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(57) Abstract 

Purified heparanase having activity of greater than 20 units/Mg protein, preferably greater than 50 units heparanase activity per jig 
protein, is described. The use of heparanase for screening for anti-beparanase compounds is also described. In addition, the use of the 
high potency heparanase to accelerate wound healing or its use as an immobilized heparanase filter connected to extracorporeal devices to 
degrade heparin and neutralize its anticoagulant properties during surgery is disclosed. 
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USE OF HEPARANASE TO IDENTIFY AND ISOLATE ANTI-HEP ARANASE 

COMPOUND 
FIELD OF INVENTION 
The present invention discloses the use of mammalian heparanase, preferably 
5 recombinant heparanase, for screening for anti-heparanase compounds. More particularly, the 
r— - .i^oiiLivjii piuviues a metnoa of selecting IHA (Inhibitors of Heparanase Activity). In 
addition, the present invention provides a purified heparanase, particularly suitable for use to 
identify and isolate anti-heparanase compounds as well as for other known uses of heparanases, 
such as its use to accelerate wound healing or its use as an immobilized heparanase filter 
10 connected to extracorporeal devices iu degrade heparin and neutralize its anticoagulant properties 
during surgery. 

BACKGROUND OF THE INVENTION 
Elevated heparanase activity has been documented in mobile, invasive cells. Examples 
include; invasive melanoma, lymphoma, mastocytoma, mammary adenocarcinoma, leukemia, 
15 and rheumatoid fibroblasts. This activity has also been documented in non-pathologic 

situations involving the migration of lymphocytes, neutrophils, macrophages, eosinphils and 
platelets. An inhibitor of heparanase would therefore broadly influence the invasive potential of 
these diverse cells. 

Inhibition of heparan sulfate degradation would also inhibit the release of bound gTowth 
20 factors and other biologic response modifiers that would, if released, fuel the growth of adjacent 
tissues and provide a supportive environment for cell growth (Rapraeger, et al, Science 252: 
1705-1708, 1991). Inhibitors of heparanase activity would be of value in the treatment of 
arthritis, vascular restenosis, tumor growth and progression, and fibro-prolifcrative disorders. 

Until now, the obstacles to designing a screening assay to find inhibitors of mammalian 
25 heparanase have been the unavailability of a mammalian heparanase that is purified to apparent 
homogeneity and the lack of information about the amino acid sequence or the 3-dimensional 
structure of the enzyme. Without the amino acid sequence, it has not been possible to produce 
recombinant mammalian heparanase to be used in large volume screening efforts. Knowledge 
of the tertiary and quaternary structures would facilitate rational design of IHA. This report 
30 overcomes obstacles relating to the sequence of the heparanase, and also provides a model for 
higher-order structure. 

Heparanase activity in mammalian cells is well known. It is found in various melanoma 
35 cells (Nakajima, et a/., Cancer Letters 31: 277-283, 1986), mammary adenocarcinoma cells 
(Tarish, et al . hit J Canrcr\ AO: 51! V : ( J-7i. 1'ikrrr- Y:ih.il ••; ' ; : . >■ 
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Research 12: 71 1-717, 1988), mast cells (Ogren and Lindahl, /. Biol Chem. 250; 2690-2697, 
1975), macrophages (Savion, et al, J. Cell, Physiol, 130: 85-92, 1987), mononuclear cells 
(Sewell, et al, Biochem. J. 264: 777-783, 1989), neutrophils (Matzner, et al, J. Leukocyte 
Biology 51: 519-524, 1992), T-cells (Vettel, et al, Eur. J. Immunol 21: 2247-2251,1991), 
5 platelets (Haimovitz-Friedman, et al, Blood 78: 789-796, 1991), endothelial cells (Godder, et 
al, / Cell Physiol 148: 274-280, 1991), and placenta (Klein and von Figura, BBRC 73: 
569, 1976). 

WO 91/02977, incorporated herein by reference, describes a substantially, but partially, 
purified heparanase produced by cation exchange resin chromatography and the affinity 
10 absorbent purification of heparanase-containing cell extract. WO 91/02977 also describes a 
method promoting wound healing utilizing compositions comprising a "purified" form of 
heparanase. 

Others have proposed the use of purified bacterial heparanase, immobilized onto filters 
and connected to extracorporeal devices, to degrade heparin and neutralize its anticoagulant 
15 properties post surgery (Freed, et al, Ann. Biomed. Eng. 21: 67-76, 1993). 

U.S. Patent 4,882,318 describes heparanase-inhibiting compositions for preventing tumor 

metastasis. 

Haimovitz-Friedman et al (Blood 78: 789-796, 1991) describe an assay for heparanase 

35 

activity that involves the culturing of endothelial cells in radiolabeled S0 4 to produce 
20 radiolabeled heparan sulfate proteoglycans, the removal of the cells which leaves the deposited 
extracellular matrix that contains the 35 S-HSPG, the addition of potential sources of heparanase 
activity, and the detection of possible activity by passing the supernatant from the radiolabeled 
extracellular matrix over a gel filtration column and monitoring for changes of the size of the 
radiolabeled material that would indicate that HSPG degradation had taken place. This assay 
25 does not have the capability for large-scale screening of inhibitors. 

Nakajima et al (Anal Biochem. 196: 162-171, 1986) describe a solid-phase substrate 
for the assay of melanoma heparanase activity. Heparan sulfate from bovine lung is chemically 
radiolabeled by reacting it with [ l4 C]-acetic anhydride. Free amino groups of the [ 14 C]-heparan 
sulfate were acetylated and the reducing termini were arninated. The [ u C]-heparan sulfate was 
30 chemically coupled to an agarose support via the introduced amine groups on the reducing 

t*i ; ... i; ., |,.. j ' .i - - - .. i : . i 1 j;r i 1 c ■ ■ 

Khan anrt N'rwman iAmii iimcnem 1%: -^7(>. \^N\) rtOAcnrx* fin indirect assav tc 



heparanase activity. In this assay, heparin is quantitated by its ability to interfere with the color 
35 development between a protein and the dye Coomassie brilliant blue. Heparanase activity is 
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detected by the loss of this interference. This assay is limited in use for screening because it is 
so indirect that other non-heparin compounds could also interfere with the protein-dye reaction. 

The CXC chemokine family (also called the intercrine a family) is one branch of the 
supergene "intercrine" cytokine family (Oppenheim, Arm. Rev. Bic ;m. 9: 617-648, 1991). 
It's members include platelet factor 4 t platelet basic protein and derivatives, yIP-10, gro(a,P/y), 
iN/vr-i/imeneuKin-o, mig, ana tiNA-/o ^ior review, see Miner ana Krangei, Lntical Keviews in 
Immunology 12: 17-46, 1992). The other branch, the CC chemokines or intercrine-fi family, 
includes MPla, Mff ipjD'MCP-1, RANTES, and MCAF. All members of both branches of 
this chemokine family characteristically are basic heparin-binding polypeptides, display 
molecular weights between 8 and 11 kD, share 20 - 509c homology, and function broadiy in 
pathologic sit:? ; ons characterized by inflammation and tissue remodeling. 

The proieolytically processed forms of platelet basic protein include CTAP-III, (5- 
thromboglobulin, and NAP-2. p-thromboglobulin (Moore, et al, Biochim. Biophys. Acta. 379: 
360-369, 1975) and CTAP-III (Castor, et al, Arthritis Rheum. 20: 859-868, 1977), were 
originally isolated from activated supernatants or lysates from outdated ph t\s. Using the 
techniques of subcellular fractionation and radioimmunoassay, P-thromboglobulin was identified 
as an a-granule protein that could be released upon activation (Kaplan, et a!. f Blood 53: 604- 
618, 1979). Platelet basic protein itself was later isolated from fresh platelets, megakaryocytes, 
and HEL cells, an immortal human erythroleukemia cell line (Holt, et al. Biochemistry 25: 
1988-1996, 1986; Holt, et al, Exp. Hematol 16: 302-306, 1988). Walz and Baggiolini 
isolated the processed form of NAP-2 from platelet-containing cultures of stimulated 
mononuclear cells (Walz, et al, J- Exp. Med. 170: 1745-1750, 1989). 

Material labeled as p-thromboglobulin is commercially available from Calbiochem, San 
Diego, CA (Cat. # 605165), Celsus Laboratories, Cincinnati, OH (Cat. # 41705), and 
Haematologic Technologies, Essex Jet., VT (Cat. # HBTG-02100. The inventors have 
determined, by using the "Purification Assay," that the commercial preparation have heparanase 
activity at a level of 0.075 units/jig. This activity is below the level of 1 unit/ug needed for the 
screening of anti-heparanase compounds in accordance with the assay of the subject invention. 

U.S. Patent 4,897,348 describes recombinant materials and methods for producing 
human connective tissue-activating peptide-m (CTAP-III) and analogs thereof. 

Transglutaminases catalyze the postradiational modification of proteins hx 

i'suuungauuna-giuLarnyulysine cross juiks ^urccnocrg, a u... / AbLB j , . M>/ 1 -30 7c>, . 
This posttranslational modification has been reported to dramatically alter the action of some 
small proteins. For example, a transglutaminase produces a glutamine-lysine cross-link in the 13 
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kD phospholipase A 2 and increases its specific enzymatic activity (Cordella-Miele, et al, J. 
Biol Chem. 265: 17180-17188, 1990). A transglutaminase cross-links another small molecule, 
interleukin-2, and converts its activity to one that is cytotoxic to mature oligodendrocytes (Eitan 
and Schwartz, Science 261: 106-108,1993). The glutamine-lysine cross-link in a protein would 
5 result in the loss of overall positive charge for that protein. The transglutaminases are optimally 
active and generally used under reducing conditions such as dithiothreitol. The concept that 
glutamine-lysine cross-linking alters the activity of these small proteins may be applicable to 
other small molecules as well. 

SUMMARY OF THE INVENTION 

10 The present invention discloses a method of screening for compounds having anti- 

heparanase activity (AHA compounds), i.e. inhibitors of heparanase activity (IHA), comprising 
the steps of: contacting a potential AHA compound with radiolabeled heparin/heparan sulfate 
and heparanase for a time and under such conditions sufficient to allow for inhibition of 
heparanase activity; detecting inhibition of heparanase activity; and selecting compounds that 

15 inhibit heparanase activity. The present invention also discloses the amino acid sequence 
identity of the heparanase that has been purified to homogeneity by chromatography under 
reducing conditions. Identification of the amino acid sequence of the protein which contains 
heparanase activity is crucial for the production of recombinant mammalian heparanase. 

DETAILED DESCRIPTION OF THE INVENTION 

20 The present invention provides a purified heparanase, and a method for producing it. 

The heparanase so produced has an activity of greater than 20 units/ pg protein, preferably 
greater than 50 units heparanase activity per ug protein (1 unit «= \% cpm < 30 K/hr using the 
"Purification Assay" (Example 2, Part D). 

In addition, the present invention provides recombinant heparanase and a means for 

25 producing it. The term "purified heparanase" as used in the specification and claims includes 
the recombinant heparanase as described in the subject application. The recombinant heparanase 
of the subject invention can be used for the same purposes and in the same manner as the 
purified heparanase. 

The purified heparanase of the present invention has an isoelectric point of less than 5.5 
30 (preferably about 4,8 - 5.1) and preferably is activated by treatment with transglutaminase using 

reducimr conditions 



.prcieraDiv aoout ^ an,i ;. isoiaicu unci'; r caucus uonuiuims as,,., , . .ivauvi r- , 

treatment with transglutaminase. 
35 Suitable transglutaminases that may be used for this purpose include Activated Factor 

XT II a. guinea pie liver iranscluMminas^, rpidrrrr.ni tranM'lularnin.'i^ koraiin'V \ 
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transglutaminase, and tissue transglutaminase. 

The heparanase of the present invention has the amino acid sequence (SEQ. ID. NO: 1) 

of: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 
5 5 10 15 

vjiu kj j u fe nivi ^jo iiL/ x-ija iiii i. in ijCi \Jiy lit ilia l IV V ^ 

20 25 30 

10 Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
35 40 45 

Val Glu v al He .Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 
50 55 60 

15 

Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 
65 70 75 80 

Asp Glu Ser Ala Asp 
20 85 

encoded by the cDNA sequence (SEQ ID NO: 2) of: 

1 AACTTGGCGA AAGGCAAAGA GGAAAGTCTA GACAGTGACT TGTATGCTGA 

■ r. 

51 ACTCCGCTGC ATGTGTATAA AGACAACCTC TGGAATTCAT CCCAAAAACA 
101 TCCAAAGTTT GGAAGTGATC GGGAAAGGAA CCCATTGCAA CCAAGTCGAA 
25 151 GTGATAGCCA CACTGAAGGA TGGGAGGAAA ATCTGCCTGG ACCCAGATGC 
201 TCCCAGAATC AAGAAAATTG TACAGAAAAA ATTGGCAGGT GATGAATCTG 
251 CTGAT 

which corresponds to the cDNA sequence and derived amino acid sequence of CTAP-IIL See 
Wenger et aL, Blood, 73: 1498-1503, 1989. 
30 In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 3) of: 

Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 

5 10 15 

35 Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys Met Cys He 

20 25 30 

Lys Thr Thr Ser Gly He His Pro Lys Asn He Gin Ser Leu Glu Val 

4n 4* 



Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro Arg He Lys 
45 65 70 75 80 
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Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala Asp 
85 90 

encoded by the cDNA sequence (SEQ ID NO: 4) of: 

1 TCCTCCACCA AAGGACAAAC TAAGAGAAAC TTGGCGAAAG GCAAAGAGGA 
51 AAGTCTAGAC AGTGACTTGT ATGCTGAACT CCGCTGCATG TGTATAAAGA 
102 CAACCTCTGG AATTCATCCC AAAAACATrr AAA qtwcicia Amr.ATPP.nr, 
151 AAAGGAACCC ATTGCAACCA AGTCGAAGTG ATAGCCACAC TGAAGGATGG 
201 GAGGAAAATC TGCCTGGACC CAGATGCTCC CAGAATCAAG AAAATTGTAC 
251 AGAAAAAATT GGCAGGTGAT GAATCTGCTG AT 

which corresponds to the cDNA sequence and derived amino acid sequence of platelet basic 
protein. See Wenger et al., Blood, 73: 1498-1503, 1989 as well as Walz and Baggiolini, BBRC 
159: 969-981, 1989; Castor, et al , BBRC 163: 1071-1078,1989. 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 5) of: 

Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys 
1 5 10 15 

Met Cys He Lys Thr Thr Ser Gly He His Pro Lys Asn He Gin Ser 
20 25 30 

Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gb Val Glu Val He 
35 40 45 

Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro 
50 55 60 

Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala 
65 70 75 80 

Asp 

encoded by the cDNA sequence (SEQ ID NO: 6) of: 

1 GGCAAAGAGG AAAGTCTAGA CAGTGACTTG TATGCTGAAC TCCGCTGCAT 
51 GTGTATAAAG ACAACCTCTG GAATTCATCC CAAAAACATC CAAAGTTTGG 
101 AAGTGATCGG GAAAGGAACC CATTGCAACC AAGTCGAAGT GATAGCCACA 
151 CTGAAGGATG GGAGGAAAAT CTGCCTGGAC CCAGATGCTC CCAGAATCAA 
201 GAAAATTGTA CAGAAAAAAT TGGCAGGTGA TGAATCTGCT GAT 



i: another aspect, \Uc prx'si'm invention provide a. nepajaiiase Having Uic amm 
sequence (SEQ ID NO; 7) of: 
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Glu Leu Arg Cys Met Cys He Lys Thr Thr Ser Gly He His Pro Lys 
1 5 10 15 

Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
5 20 25 30 

Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 
35 40 45 

10 Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 
50 55 60 

Asp Glu Ser Ala Asp 
65 



1 r 

1 J 



20 



encoded by the cDNA sequence (SEQ ID NO: 8) of: 

1 GAACTCCGCT GCATGTGTAT AAAGACAACC TCTGGAATTC ATCCCAAAAA 
51 CATCCAAAGT TTGGAAGTGA TCGGGAAAGG AACCCATTGC AACCAAGTCG 
101 AAGTGATAGC CACACTGAAG GATGGGAGGA AAATCTGCCT GGACCCAGAT 
151 GCTCCCAGAA TCAAGAAAAT TGTACAGAAA AAATTGGCAG GTGATGAATC 
201 TGCTGAT 

which corresponds to the cDNA sequence and derived amino acid sequence of neutrophil 
activating peptide-2. 

The foregoing amino acid sequences correspond to the products of a single gene called 
25 platelet basic protein (Walz and Baggiolini, BBRC 159: 969-981, 1989; Castor, et al, BBRC 
163: 1071-1078, 1989). The complete gene sequence of platelet basic protein is well known. 
See, for example, Wenger et al., Blood, 73: 1498-1503, 1989 and Proc. Natl Acad. Sci. USA, 
90, 3660-3664, 1993. 

The present invention also provides heparanase having the amino acid sequences of 
30 other members of the CXC chemokine family [including Platelet factor 4 (SEQ. ID NO. 12), 
yIP-10 (SEQ. ID NO. 14), gro/MGSA (SEQ. ID NO. 16), gro-$/MXP-2a (SEQ. ID NO. 18), 
g< /MEP-2P (SEQ. ID NO. 20), Interleukin-8/NAP-l (SEQ. ID NO. 22) and ENA-78 (SEQ. ID 
NO. 24)] as well as members of the CC chemokine family [including MlP-la (SEQ. ID NO. 
26), MIP-ip (SEQ. ID NO. 28), 1-309 (SEQ. ID NO. 23), MCP-1 (SEQ. ID NO. 32), MC^-3 
35 (SEQ. ID NO. 34), RANTES (SEQ. ID NO. 36), fic (SEQ. ID NO. 38) and MCP-2 (SE« C ID 
NO. 40)]; purified to apparent homogeneity prepared in thr nr^c-n^ -f rMnHm- ^^ r ^; h ^ 



-.1 i i 'L,. * la v*i i i j \ i 1 UL h : 



Liu;, purpose muuuc Activated factor AliU, guinea pig nver transglutaminase, epioerma. 
transglutaminase, keratinocytc transglutaminase, and tissue transglutaminase. 
40 Id another aspect, the present invention provides a heparanase having the amino acid 
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sequence (SEQ ID NO: 12) of: 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe Leu Gly Leu Leu 
Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu AJa GIu Giu Asp Gly Asp Leu Gin 
Cys Leu Cys Val Lys Thr Thr Ser Gin Val Arg Pro Arg His He Thr Ser Leu Glu Val 
He Lys Ala Giy Pro His Cys Pro Thr Ala Gin Leu He Ala Thr Leu Lys Asn Gly Arg 



f \rc Tlo Piro 1 am Aon I rz.}~ a i« n_~ T _.. T 



_ T ... T 

j - —j ~ —j 



Tl . Tl T 



'J f *«AsM iiVU W1M 



Ser 



encoded by the cDNA sequence (SEQ ID NO: 13) of: 

1 CCGCAGCATG AGCTCCGCAG CCGGGTTCTG CGCCTCACGC CCCGGGCTGC 
51 TGTTCCTGGG GTTGCTGCTC CTGCCACTTG TCiGTCGCCTT CGCCAGCGCT 
101 GAAGCTGAAG AAGATGGGGA CCTGCAGTGC CTGTGTGTGA AGACCACCTC 
151 CCAGGTCCGT CCCAGGCACA TCACCAGCCT GGAGGTGATC AAGGCCGGAC 
201 CCCACTGCCC CACTGCCCAA CTGATAGCCA CGCTGAAGAA TGGAAGGAAA 
251 ATTTGCTTGG ACCTGCAAGC CCCGCTGTAC AAGAAAATAA TTAAGAAACT 
301 TTTGGAGAGT TAGCTACTAG CTGCCTACGT GTGTGCATTT GCTATATAGC 
351 ATACTTCTTT TTTCCAGTTT CAATCTAACT GTGAAAGAAA CTTCTGATAT 
401 TTGTGTTATC CTTATGATTT TAAATAAACA AAATAAATC 

which corresponds to the cDNA sequence and derived amino acid sequence of platelet factor 4. 

See Poncz et al., Blood 69, 219-223 (1987). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 14) of: 

Met Asn Gin Thr Ala He Leu He Cys Cys Leu He Phe Leu Thr Leu Ser Gly He Gin Gly 
Val Pro Leu Ser Arg Thr Val Arg Cys Thr Cys He Ser He Ser Asn Gin Pro Val Asn Pro 
Val Asn Pro Arg Ser Leu Glu Lys Leu Glu He He Pro Ala Ser Gin Phe Cys Pro Arg 
Val Glu He He Ala Thr Met Lys Lys Lys Gly Glu Lys Arg Cys Leu Asn Pro Glu Ser 
Lys Ala He Lys Asn Leu Leu Lys Ala Val Ser Lys Glu Met Ser Lys Arg Ser Pro 
encoded by the cDNA sequence (SEQ ID NO: 15) of: 

1 GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC 
51 CTCCAGTCTC AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA 
101 TCTTTCTGAC TCTAAGTGGC ATTCAAGGAG TACCTCTCTC TAGAACCGTA 
151 CGCTGTACCT GCATCAGCAT TAGTAATCAA CCTGTTAATC CAAGGTCTTT 



■■■■■ . TCGAAGGCCA TCAAGaaTTT aCTGAAAGL'A GTTAGCAAGG AAATGTCTAA 
351 AAGATCTCCT TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG 
401 GATGGACCAC ACAGAGGCTG CCTCTCCCAT CACTTCCCTA CATGGAGTAT 
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451 ATGTCAAGCC ATAATTGTTC TTAGTTTGCA GTTACACTAA AAGGTGACCA 
501 ATGATGGTCA CCAAATCAGC TGC7ACTACT CCTGTAGGAA GGTTAATGTT 
551 CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 
601 GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGA r C CTGCTTCAAA 
which corresponds to the cDNA sequence and derived amino acid sequent yIP-10. See Luster 



In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 16) of: 

Met Ala Arg Ala Ala Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 

Infill I .fill I .fill 1 J»ll Ji»,ll Val Ala Ala filv Ara Ara Ala Ala dUt Ala Cat- V*l A1« TU- 

- - - — - — • j ' " ^ ' t> * " ■* "** 1 J ■* "** ^ » Ul 1 UU J. 111 1 U 

Leu Arg Cys Gin Cys Leu Gin Thr Leu Gin Gly He His Pro Lys Asn He Gin Ser Val 
Asn Val Lys Ser Pro Gly Pro His Cys Ala Gin Thr Glu V lie Ala Thr Leu Lys Asn 
Gly Arg Lys Ala Cys Leu Asn Pro Ala Ser Pro He Val Lys Lys He He Glu Lys Met 
Leu Asn Ser Asp Lys Ser Asn 
encoded by the cDNA sequence (SEQ ID NO: 17) of: 

1 CTCGCCAGCT CTTCCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC 
51 CCCATGGCCC GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT 
101 GCGAGTGGCA CTGCTGCTCC TGCTCCTGGT AGCCGCTGGC CGGCGCGCAG 
1 5 1 CAGGAGCGTC CGTGGCCACT GAACTGCGCT GCCAGTGCTT GCAGACCCTG 
201 CAGGGAATTC ACCCCAAGAA CATCCAAAGT GTGAACGTGA AGTCCCCCGG 
251 ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 
301 AAGCTTGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG 
351 ATGCTGAACA GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC 
401 TGGTGGCTGT TCCTGAAGGA GGCCCTGCCC TTATAGGAAC AGAAGAGGAA 
451 AGAGAGACAC AGCTGCAGAG GCCACCTGGA TTGTGCCTAA TGTGTTTGAG 
501 CATCGCTTAG GAGAAGTCTT CTATTTATTT ATTTATTCAT TAGTTTTGAA 
551 GATTCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 
601 CTACCTGCAC ACTGTCCTAT TATATTCATT CTTTTTGAAA TGTCAACCCC 
651 AAGTTAGTTC AATCTGGATT CATATTTAAT TTGAAGGTAG AATGTTr ~ \ 
701 AATGTTCTCC AGTCATTATG TTAATATTTC TGAGGAGCCT GCAACATGCC 
751 AGCCACTGTG ATAGAGGCTG GCGGATCCAA GCAAATGGCC AATGAGATCA 



riTCTCATGT TGAAACTTTA AGAACTAAAA TGTTCTAAAT ATCCCTTGGA 
951 CATTTTATGT CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT 
1001 ACAGTGTTTC TGGCTTAGAA CAAAGGGGCT TAATTATTGA TGTTTTCGGA 
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which corresponds to the cDNA sequence and derived amino acid sequence of gro/MGSA 
(melanoma growth stimulatory activity). See Anisowicz et al., Proc. Natl. Acad. Sci. U.S.A. 
84,7188-7192 (1987). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 18) of; 



Met Ala Are Ala Thr Tr-n Ser AU au p™ c*r a«*« d~ a.* t ... t ... * 



Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala Ala Gly Ala Pro Lys Ala Thr Glu 
Lys Arg Cys Gin Cys Lys Gin Thr Leu Gin Gly He His Leu Lys Asn He Gin Ser Val 
Lys Val Lys Ser Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 
Gly Gin Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Lys Lys lie lie Glu Lys Met 
Leu Lys 

encoded by the cDNA sequence (SEQ ID NO: 19) of: 

1 CTCTCCTCCT CGCACAGCCG CTCGAACCGC CTGCTGAGCC CCATGGCCCG 
51 CGCCACGCTC TCCGCCGCCC CCAGCAATCC CCGGCTCCTG CGGGTGGCGC 
101 TGCTGCTCCT GCTCCTGGTG GCCGCCAGCC GGCGCGCAGC AGGAGCGCCC 
151 CTGGCCACTG AACTGCGCTG CCAGTGCTTG CAGACCCTGC AGGGAATTCA 
201 CCTCAAGAAC ATCCAAAGTG TGAAGGTGAA GTCCCCCGGA CCCCACTGCG 
251 CCCAAACCGA AGTCATAGCC ACACTCAAGA ATGGGCAGAA AGCTTGTCTC 
301 AACCCCGCAT CGCCCATGGT TAAGAAAATC ATCGAAAAGA TGCTGAAAAA 
351 TGGCAAATCC AACTGACCAG AAGGAAGGAG GAAGCTTATT GGTGGCTGTT 
401 CCTGAAGGAG GCCCTGCCCT TACAGGAACA GAAGAGGAAA GAGAGACACA 
451 GCTGCAGAGG CCACCTGGAT TGCGCCTAAT GTGTTTGAGC ATCACTTAGG 
501 AGAAGTCTTC TATTTATTTA TTTATTTATT TATTTGTTTG TTTTAGAAGA 
551 TTCTATGTTA ATATTTTATG TGTAAAATAA GGTTATGATT GAATCTACTT 
601 GCACACTCTC CCATTATATT TATTGTTTAT TTTAGGTCAA ACCCAAGTTA 
651 GTTCAATCCT GATTCATATT TAATTTGAAG ATAGAAGGTT TGCAGATATT 
701 CTCTAGTCAT TTGTTAATAT TTCTTCGTGA TGACATATCA CATGTCAGCC 
751 ACTGTGATAG AGGCTGAGGA ATCCAAGAAA ATGGCCAGTG AGATCAATGT 
801 GACGGCAGGG AAATGTATGT GTGTCTATTT TGTAACTGTA AAGATGAATG 
851 TCAGTTGTTA TTTATTGAAA TGATTTCACA GTGTGTGGTC AACATTTCTC 
901 ATGTTGAAGC TTTAAGAACT AAAATGTTCT AAATATCCCT TGGACATTTT 



35 



:,M ' AAAGAACAG GAAAATAAAA TATTTAAAAA i 
winch corresponds to the cDNA sequence and derived amino acid sequence gr 0 -p/MIP-2a 
(macrophage inflammatory protein 2-a). See Tekamp-Olson et al.. J. Exp. Med. 172. 9M-Q1Q 
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(1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 20) of: 

Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 
5 Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala Ala Gly Ala Ser Val Val Thr Glu 

I »i, A r ~ I y r wM _ ^ v 11& Oiii OC1 Vill 

Asn Val Arg Ser Pro Gly Pro His Cys Ala Gin Thr Glu Val He Ala Thr Leu Lys Asn 
Gly Lys Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Gin Lys He He Glu Lys He Leu 
Asn Lys Gly Ser Thr Asn 
10 encoded by the cDNA sequence (SEQ ID NO: 21) of: 

1 CTCGCACAGC TTCCCGACGC GTCTGCTGAG CCCCATGGCC CACGCCACGC 
5 1 TCTCCGCCGC CCCCAGCAAT CCCCGGCTCC TGCGGGTGGC GCTGCTGCTC 
101 CTGCTCCTGG TGGCCGCCAG CCGGCGCGCA GCAGGAGCGT CCGTGGTCAC 
151 TGAACTGCGC TGCCAGTGCT TGCAGACACT GCAGGGAATT CACCTCAAGA 
15 201 ACATCCAAAG TGTGAATGTA AGGTCCCCCG GACCCCACTG CGCCCAAACC 
25 1 GAAGTCATAG CCACACTCAA GAATGGGAAG AAAGCTTGTC TCAACCCCGC 
301 ATCCCCCATG GTTCAGAAAA TCATCGAAAA GATACTGAAC AAGGGGAGCA 
351 CCAACTGACA GGAGAGAAGT AAGAAGCTTA TCAGCGTATC ATTGACACTT 
401 CCTGCAGGGT GGTCCCTGCC CTTACCAGAG CTGAAAATGA AAAAGAGAAC 
20 451 AGCAGCTTTC TAGGGACAGC TGGAAAGGAC TTAATGTGTT TGACTATTTC 
501 TTACGAGGGT TCTACTTATT TATGTATTTA TTTTTGAAAG CTTGTATTTT 
551 AATATTTTAC ATGCTGTTAT TTAAAGATGT GAGTGTGTTT CATCAAACAT 
601 AGCTCAGTCC TGATTATTTA ATTGGAATAT GATGGGTTTT AAATGTGTCA 
651 TTAAACTAAT ATTTAGTGGG AGACCATAAT GTGTCAGCCA CCTTGATAAA 
25 701 TGACAGGGTG GGGAACTGGA GGGTGGGGGG ATTGAAATGC AAGCAATTAG 
751 TGGATCACTG TTAGGGTAAG GGAATGTATG TACACATCTA TTTTTTATAC 
801 TTTTTTTTTA AAAAAAGAAT GTCAGTTGTT ATTTATTCAA ATTATCTCAC 
851 ATTATGTGTT CAACATTTTT ATGCTGAAGT TTCCCTTAGA CATTTTATGT 
901 CTTGCTTGTA GGGCATAATG CCTTGTTTAA TGTCCATTCT GCAGCGTTTC 
30 951 TCTTTCCCTT GGAAAAGAGA ATTTATCATT ACTGTTAC 

which corresponds to the cDNA sequence and derived amino acid sequence gro-y/MIP-2$ 



iii anotner aspect, tnc present invention provides a heparanase having the amino acid 
35 sequence (SEQ ID NO: 22) of: 

Met Thr Ser Lvs Leu Ala Val Ala Leu I en Ah Ala Phe l^u II- Ser Ah Ah In: C\ 
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Glu Gly Ala Val Leu Pro Arg Ser Ala Lys Glu Leu Arg Cys Gin Cys He Lys Thr Tyr 
Ser Lys Pro Phe His Pro Lys Phe He Lys Glu Leu Arg Val He Glu Ser Gly Pro His Cys 
Ala Asn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro Lys 
Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe Leu Lys Arg Ala Glu 
encoded by the cDNA sequence (SEQ ID NO: 23) of: 

1 ATGACTTrr A Anrronrrr.T r-r.rTrrrvrr/- CCACCCTTCC 7CA777C7CC 
51 AGCTCTGTGT GAAGGTGCAG TTTTGCCAAG GAGTGCTAAA GAACTTAGAT 
101 GTCAGTGCAT AAAGACATAC TCCAAACCTT TCCACCCCAA ATTTATCAAA 
151 GAACTGAGAG TGATTGAGAG TGGACCACAC TGCGCCAACA CAGAAATTAT 
201 TGTAAAGCTT TCTGATGGAA GAGAGCTCTG TCTGG ACCCC AAGGAAAACT 
251 GGGTGCAGAG GGTTGTGGAG AAGTTTTTGA AGAGGGCTGA G 
which corresponds to the cDNA sequence and derived amino acid sequence Interleukin-8/NAP-l 
(neutrophil activating protein-1). See Kunser et al., Kidney Int. 39, 1240-1248 (1991). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 24) of: 

Ala Gly Pro Ala Ala Ala Val Leu Arg Glu Lys Arg Cys Val Cys Leu Gin Thr Thr Gin 
Gly Val His Pro Lys Met He Ser Asn Leu Gin Val Phe Ala He Gly Pro Gin Cys Ser 
Lys Val Glu Val Val Ala Ser Leu Lys Asn Gly Lys Glu lie Cys Leu Asp Pro Glu Ala 
Pro Phe Leu Lys Lys Val He Gin Lys lie Leu Asp Gly Gly Asn Lys Glu Asn 
encoded by the cDNA sequence (SEQ ID NO: 25) of: 

1 GTGTTGCGGG AACTGCGGTG CGTGTGTTTA CAGACCACGC AGGGAGTTCA 
5 1 TCCCAAAATG ATCAGTAATC TGCAAGTGTT CGCCATAGGC CCACAGTGCT 
101 CCAAGGTGGA AGTGGTAGCC TCCCTGAAGA ACGGGAAGGA AATTTGTCTT 
151 GATCCAGAAG CCCCTTTTCT AAAGAAAGTC ATCCAGAAAA TCCTCGACGG 
201 CGGCAACAAA GAAAAC 
which corresponds to the cDNA sequence and derived amino acid sequence of a novel 
inflammatory peptide (ENA-78) with homology to interleukin 8. See Walz et al., J. Exp. Med. 
174, 1355-1362 (1991). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 26) of: 

Met Gin Val Ser Thr Ala Ala Leu Ala Val Leu Leu Cys Thr Met Ala Leu Cys Asn Gin 

T" Sor \'ai lie Phe 'Sin L) ^ Ar^; Gi> Arg Win \ ai Lys Aia Asp Pro Ser Glu Glu 
Trp Val Gin Lys Tyr Val Ser Asp Leu Glu Leu Ser Ala 
encoded by the cDNA sequence (SEQ ID NO: 27) of 
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1 GAATTCAAGG CCTGTCCTGG TTTGGTCCCA ATTTACCTTT ATCATCCATA 
51 TTCACCCCCA CTGCTCTGCA GCTCCACTGA AGCACCCCCT CTTTCCTCTG 
101 AGCCACAATG TCACACCCAG GACTCTGCCT CAGCTGGGCC TCCACTGCCC 
151 ACCCATCTAT AGATGCCTAA ATCCCGGGCA GTTATCCAGA CACAACTAAA 
201 GTTCCATCCC TTCCATGAAG CCTTCCCCAA CCCTCTGGTG GAAGGTCACT 
251 TCTTCCTCAT nnnnrrrrnA nrrrrr+TH* : ; : ; ; CTAC 7AACAC77T a 
301 ACAATTACCT GTTCATACAC TCTACCTGCC CCCATGAGAC CAGGGGCATC 
351 TCAGAAACAA AGATCATTAA AACCAACTAA ATCTATTTCT CATTATAAAA 
401 TGAGATATGC TGATTGATTG CAAAATAATA AAATAACAAA GTATGGAAAA 
451 GAAAAAAAAA AGCATATAAT CTGGCTGAGA AGGTAGAGAC CCTTCCACAC 
501 CACTGAAATT ATGTGTTGAA AAGAATAAGG AAAAAACTGC TTCAGTTTGG 
551 CATTATTTAT GTAAGTATAG TATAGGATCC TTAAAATGGT TCAAAGAAAT 
601 GGGAAATCAA GACTTCATTT TGGCAAAGCC ATTGAACAGA AACTGTAGCA 
65 1 TATTTATCAG TAATTTCTTT CAGATTAAAC AACTG ACAAC AACCCACTTT 
701 TCAACCAGTG ATGTTGGAAA TGTTTTAAAA CAAAATTAGT TCATAAATTT 
751 GTGGGTTGAC CAAGAAGGTA ATAAAGTCTC ACTAAATAAA ATGAGGAAAA 
801 TTCAGAAAAA GAAAAAAATA AGAAAATAAA TCACCCATGG ATCTAAGCAC 
851 TATTCATTCT TTAAGGCATG TATTTCCAAG CCTTTTAATT TTTTCATGCC 
901 TAGAGTTGGC ATGGCATATA TATATCTTTA TACAATTCTT CAAATTTTAT 
951 AGAATTTGTA TAATGTTTTA TCTTGCTTTT TTTTTAACCA CTGATGTTAT 
1001 AAGCATATTT ATGCCACTTC ATTCACGTTA GAGACTTAAT AATAAAGGAT 
1051 CTTGTGGATA ATTTATCATT CCCTGATAGA GAAAAATTTA GCTTTGCTTA 
1101 TTTTAGAGTT ATAAATGATG CTGGGTCAGG TATCTTTATG TTTGAAGATG 
1151 GCTCCATATT TGGGTTGTTT CCACAGAACT CTTTCCAGAA ATGCTTTTTC 
1201 TAGGTTAATG GCTACACATA TTTCTAGGCA CCTGACATAC TGACACCCAC 
1251 CTCTAAAGTA TTTTTATGAT CCACAACTAG CGTTTAACAC AGCGCCCCAG 
1301 TCACTCCGAG ACTAATAAAT AGACAAATGA CTGAAACGTG ACCTCATGCT 
1351 TTCTATTCCT CCAGCTTTCA TTGAGTTCCT TTCCTCTGGG AGGACTGGGG 
1401 GTTGTCTAGC CCTCCACAGC ATCAGCCCAT TGACCCTATC CTTGTGGTTA 
1451 TAGCAGCTGA GGAAGCAGAA TT AGCTCT GTGGGAAGGA ATGGGGCTGG 
1501 AGAGTTCATG CATAGACCAA TTCiTTTTTT HUllllTl ' TGAGATGGAG 



^ : GAGTAGCTGG GATTACAGGC A1GTGCCACC ACGCCTGACT ACTTTTGTAT 
1701 TTTTAGTAGA GATGGAGTTT CTCTTTCTTG GTCAGGTTGG TCTCAAACTC 
1751 CTGACCTCAG GTGATCTGCA GCCTCGGCCT CCAAAGTGTT GGG ATT AC AG 



WO 95/04158 



PCT/US94/08207 



-14- 

1801 GTGTGAGCGA CCATGCCTGG CTGCATAGAC CAGTTCTTAT GAGAAGGGAT 
1851 CAACTAAGAA TAGCCTTGGG TTGACACACA CCCCTCTTCA CACTCACAGG 
1901 AGAAACCCCA TGAAGCTAGA ACCAGTCATG AGTTGAGAGC TGAGAGTTAG 
1951 AGAGTAGCTC AGAGATGCTA TTCTTGGATA TCCTGAGCCC CTGTGGTCAC 
2001 CAGGGACCCT GAGTTGTGCA ACACTCAGCA TGACAGCATC ACTACACTTA 

9n*\1 AAA A TTTPPr TPTTr" a r^r^r^r^ * > t^^^ * ■ — — — — — - - , - . , 

2101 CCTATAAAGA GGAGAGATGG CTTCAGACAT CAGAAGGACG CAGGCAGCAA 

2151 AGAGTAGTCA GTCCCTTCTT GGCTCTGCTG ACACTCGAGC CCACATTCCA 

2201 TCACCTGCTC CCAATCATGC AGGTCTCCAC TGCTGCCCTT GCCGTCCTCC 

2251 TCTGCACCAT GGCTCTCTGC AACCAGGTCC TrTCTGCACC ACGTGAGTCC 

2301 ATGTTGTTGT TGTGGGTATC ACCACTCTCT GGCCATGGTT AGACCACATC 

2351 AGTcnrnr TGTGGCGTGA GAGGCCCCGA agagaaaaga aggaagttct 
2401 taaagcgctg cc aaacacct tggtcttttt cttcacaact tttattttta 
245 1 tctctagaag gggtcttagc cctcctagtc tccaggtatg agaatctagg 
2501 caggggcagg ggagttacag tcccttgtac agatagaaaa acagggttca 
2551 aaacgaatca gtttgcaaga ggcagaatcc agggctgctt acttcccagt 
2601 ggggtctgtt cttcactctc cagctcaccc tagtctccca ggagccctgt 
2651 cccttggatg tcttatgaga gatgtccagg gcttctcttg ggctggggta 

2701 TGACTTCTTG AACCGACAAA ATTCCATGAA GAGAGCTAAG AGAACAGTCC 
2751 ATTCAGGTAT CTGGATCACA TAGAGAAACA GAGAACCCAC TATGAAGAGT 
2801 CAAGGGGAAA GAGGAATATA GACAGAAACA AAGAGACATT TCTCTGCAAA 
285 1 ACCCCCCA AA TGCCTTGCAG TCACTTGGTC TGAGCAAGCC TGCCCTCCTC 
2901 AACCACTCAG GGATCAGAAG CTGCCTGGCC TTTTCTTCTG AGCTGTGACT 
295 1 TGGGCTTATT CTCTCCTTTC TCCGCAGTTG CTGCTGACAC GCCGACCGCC 
3001 TGCTGCTTCA GCTACACCTC CCGACAGATT CCACAGAATT TCATAGCTGA 
3051 CTACTTTGAG ACGAGCAGCC AGTGCTCCAA GCCCAGTGTC ATGTAAGTGC 
3101 CAGTCTTCCT GCTCACCTCT AGGGAGGTAG GGAGTGTCAG GGTGGGGGCA 
3151 G AAACAGGCC AGAAGGCCAT CCTGGAAAGG CCCAGCCTTC AGGAGCCTAT 
3201 CGGGGATACA GGACGCAGGG CACTGAGGTG TGACCTGACT TGGGGCTGGA 
3251 GTGAGGTGGG TGTTACAGAG TCAGGAAGGG CTGCCCCAGG CCAGAGGAAA 
3301 GGGACAGGAA GAAGGAGGCA GCAGGACACT CTGAGGGCCC CCTTGCCTGG 

-■1--T- , , ■ „ T ■ ■ 

■■ T ( 1 * J| ! : i. ! .. 

..■ . ... . \ \. I W <. J V V . 

^ . TGGTGGGCCC AGGATTCCCC GGCTGGATTC CCCAGTGCIT AACTCTTCCT 
3501 CCCTTCTCCA CAGCTTCCTA ACCAAGAGAG GCCGGCAGGT CTGTGCTGAC 
3551 CCCAGTGAGG AGTGGGTCCA GAAATACGTC A GTG A CCTGG A GCTG A G TG C 
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3601 CTGAGGGGTC CAGAAGCTTC GAGGCCCAGC GACCTCAGTG GGCCCAGTGG 
3651 GGAGGAGCAG GAGCCTGAGC CTTGGGAACA TGCGTGTGAC CTCCACAGCT 
370. ArCTCITCTA TGGACTGGTT ATTGCCAAAC AGCCACACTG TGGGACTCTT 
375 1 CTTAACTTAA ATTTTAATTT ATTTATACTA TTTAGTTTTT ATAATTTATT 
5 3801 TTTGATTTCA CAGTGTGTTT GTGATTGTTT GCTCTG GAG TTCCCCCTGT 
385 1 CCCCTCCACC TTCCCTCACA GTGTGTCTGG TCtACA a rrr. a . 
3901 TCGGCCTGTG TAGGCAGTCA TGGCACCAAA GCCACCAGAC TGACAAATGT 
3951 GTATCAGATG CTTTTGTTCA GGGCTGTGAT CGGCCTGGG' AAATAATAAA 
4001 GATGTTCTTT TAAACGGTAA ACCAGTATTG AGTTTGGTTT TGTTTTTCTG 
4051 GCAAATCA AA .TCACTGGTT AAGAGGAATC ATAGGCAAAG ATTAGGA at.a 
4101 GGTGAAATGG AGGGAAATTG GGAGAGATGG GGAGCGCTGC GACAGAGTTA 
4151 TCCACITCAC AAAATTCTGG AACATTGAAA CTACGAATAT GTTATAACTr 
4201 AAATCGTAAT ATGCACGCTC TAGGAGAATT AACTACTTGA ATGGCCAQ 
4251 TTAAGCAGAG TATTCTGTAG GGCATATTCA TGATGAATCA AGCTCTTAAT 
4301 AGCAATTATT TACATTGTTG AGGCTTACTC CTCCTACTGA GTGC7TTTTA 
4351 TACATTGTTC ATTTAATCTT ACCAATGCAA .AGTACAGCT TAGGTACTAT 
4401 TAATACCTCC ACTTGACAGA AAAGTAACCC AGGGCTCAGA AAGGTTAGAC 
4451 AACTTGGCTG AGGTTACACA GCACGTAAAC GGTCAATTGT GTTCCAAAAC 
4501 TGGACTTTTA TTGAACTACA GACTATGCTG TTAACCATTG ACCAAGTTAT 
4551 TTCCCAAAGT ATGACCCGCC TATAGTCAAA TCTTACCCCA TTCTTTAACA 
4601 GATGATACTT TATCCATTGC AACCACTTCC TGTCAGGATT CTGAGTTGAC 
4651 ATAGAGTGTT TCAGCAGTGA TTATTTAAGC CAATTACATC AGGATCTTTA 
4701 GGTGTAGACC TGGGAACTGA TATTTTTATC AAGCTCATGA GGTGTTCCAT 
4751 AGCATGTTAA TGACTGAGAG CCACTGTCAA TAGAATTC 
which corresponds to the cDNA sequer.e and derived amino acid sequence MP-la 
(macrophage inHammatory protetn 1-a). See Blum et al., DNA Cell Biol. 9, 589-602 (1990) 
In another aspect, the present invention provides a heparanase bav^ the amino and 
sequence (SEQ ID NO: 28) of: 

Met Lys Leu Cys Val Thr Val Leu Ser Leu Leu Met Leu Val Ala Ala Phe Cys Ser Pro 
Ala Leu Ser Ala Pro Met Gly Ser Asp Pro Pro Thr Ala Cys Cys Phe Ser Tyr Thr Ala 
Arg Lys Leu Pro Arg Asn Phe Val Val Asp Tyr Tyr G.u Thr Ser Ser Leu Cys Ser Gin 

Pr-r. Alo V .' p| ^ 

i-oacc t>v tnc cDNA sequence (SEQ ID NO, 29) ol 

1 TTCCCCCCCC CCCCCCCCCC CCCCGCCCGA GCACAGGACA CAGCTGGGTT 
51 CTGAAGCTTC TGAGTTCTGC AGCCTCACCT CTGAGAAAAC nrrrmTA 
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101 CCAATACCAT GAAGCTCTGC GTGACTGTCC TGTCTCTCCT CATGCTAGTA 
151 GCTGCCTTCT GCTCTCCAGC GCTCTCAGCA CCAATGGGCT CAGACCCTCC 
201 CACCGCCTGC TGCTTTTCTT ACACCGCGAG GAAGCTTCCT CGC AACTTTG 
251 TGGTAGATTA CTATGAGACC AGCAGCCTCT GCTCCCAGCC AGCTGTGGTA 
5 301 TTCCAAACCA AAAGAAGCAA GCAAGTCTGT GCTGATCCCA GTGAATCCTG 
351 GGTCCAGGAG TACGTGTATO ACCTGaA apt n.A Arrr.Ar.rT ^^r.v^.-VTAC 
401 AGGAAGTCTT CAGGGAAGGT CACCTGAGCC CGGATGCTTC TCCATGAGAC 
451 ACATCTCCTC CATACTCAGG ACTCCTCTCC GCAGTTCCTG TCCCTTCTCT 
501 TAATTTAATC TTTTTTATGT GCCGTGTTAT TGTATTAGGT GTCATTTCCA 
10 551 TTATTTATAT TAGTTTAGCC AAAGGATAAG TGTCCTATGG GGATGGTCCA 
601 CTGTCACTGT TTCTCTGCTG TTGCAAATAC ATGGATAACA CATTTGATTC 
651 TGTGTGTTTT CCATAATAAA ACTTTAAAAT AAAATGCAGA CAGTTA 
which corresponds to the cDNA sequence and derived amino acid sequence MIP-ip 
(macrophage inflammatory protein 1-f}). See Lipes et al., Proc. Natl. Acad. Sci. U.S.A. 85, 
15 9704-9708 (1988). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 30) of: 

Met Gin He He Thr Thr Ala Leu Val Cys Leu Leu Leu Ala Gly Met Trp Pro Glu Asp 
Val Asp Ser Lys Ser Met Gin Val Pro Phe Ser Arg Cys Cys Phe Ser Phe Ala Glu Gin 
20 Glu He Pro Leu Arg Ala He Leu Cys Tyr Arg Asn Thr Ser Ser He Cys Ser Asn Glu 

Gly Leu He Phe Lys Leu Lys Arg Gly Lys Glu Ala Cys Ala Leu Asp Thr Val Gly Trp 
Val Gin Arg His Arg Lys Met Leu Arg His Cys Pro Ser Lys Arg Lys 
encoded by the cDNA sequence (SEQ ID NO: 31) of: 

1 ACCAGGCTCA TCAAAGCTGC TCCAGGAAGG CCCAAGCCAG ACCAGAAGAC 
25 51 ATGCAGATCA TCACCACAGC CCTGGTGTGC TTGCTGCTAG CTGGGATGTG 

101 GCCGGAAGAT GTGGACAGCA AGAGCATGCA GGTACCCTTC TCCAGATGTT 
151 GCTTCTCATT TGCGGAGCAA GAGATTCCCC TGAGGGCAAT CCTGTGTTAC 
201 AGAAATACCA GCTCCATCTG CTCCAATGAG GGCTTAATAT TCAAGCTGAA 
251 GAGAGGCAAA GAGGCCTGCG CCTTGGACAC AGTTGGATGG GTTCAGAGGC 
30 301 ACAGAAAAAT GCTGAGGCAC TGCCCGTCAA AAAGAAAATG AGCAGATTTC 
351 TTTCCATTGT GGGCTCTGGA AACCACATGG CTTCACCTGT CCCCGAAACT 



i i A( AATCATC AACCCCCAAC 
5 which corresponds to the cDNA sequence and derived amino acid sequence human secreted 
protein (1-309). See Miller et al., J. Immunol 143, 2907-2916 (1989) 
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In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 32) of: 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu He Ala Ala Thr Phe lie Pro Gin 
Gly Lys Ala Gin Pro Asp Ala Re Asn Ala Pro Val Thr Cys Cys Tyr Asn Phe Thr Asn 
5 Arg Lys lie Ser Val Gin Arg Leu Ala Ser Tyr Arg Arg lie Thr r -*-- Ser Lys Cys Pro Lys 

PtIh Ala Val TU PV^ I ,/c TUr TU Vol Al„ f ... m.. ti. * ^ ^ ^ 7^ 

Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr Pro Lys Thr 
encoded by the cDNA sequence (SEQ ID NO: 33) of; 

1 CTAACCCAGA AACATCCAAT TCTCAAACTG AAGCTCGCAC TCTCGCCTCC 
10 51 AGCATGAAAG TCTCTGCCGC CCTTCTGTGC CTGCTGCTCA T A GC A GCC AC 

101 CTTCATTCCC CAAGGGCTCG CTCAGCCAGA TGCAATCAAT GCCCCAGTCA 
151 CCTGCTGTTA TAACTTCACC AATAGGAAGA TCTCAGTGCA GAGGCTCGCG 
201 AGCTATAGAA GAATCACCAG CAGCAAGTGT CCCAAAGAAG CTGTGATCTT 
251 CAAGACCATT GTGGCCAAGG AGATCTGTGC TGACCCCAAG CAGAAGTGGG 
15 301 TTCAGGATTC CATGGACCAC CTGGACAAGC AAACCCAAAC TCCGAAGACT 

351 TGAACACTCA CTCCACAACC CAAGAATCTG CAGCTAACTT ATTTTCCCCT 
401 AGCTTTCCCC AGACACCCTG TTTTATTTTA TTATAATGAA TTTTGTTTGT 
45 1 TGATGTGA AA C ATTATGCCT TAAGTAATGT TAATTCTTAT TTA AGTTATT 
501 GATGTTTTAA GTTTATCTTT CATGGTACTA GTGTTTTTTA GATACAGAGA 
20 551 CTTGGGGAAA TTGCTTTTCC TCTTGAACCA CAGTTCTACC CCTGGGATGT 
601 TTTGAGGGTC TTTGCAAGAA TCATTAATAC AAAGAATTTT TTTTAACATT 
651 CCAATGCATT GCTAAAATAT TATTGTGGAA ATGAATATTT TGTAACTATT 
701 ACACCAAATA AATATATTTT TGTAC 
which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
25 chemoattractant protein 1 (MCP-1). See Yoshimura et al M FEBS Lett 244, 487-493 (1989). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 34) of: 

Met Lys Ala Ser Ala Ala Leu Leu Cys Leu Leu Leu Thr Ala Ala Ala Phe Ser Pro Gin 
Gly Leu Ala Gin Pro Val Gly lie Asn Thr Ser Thr Thr Cys Cys Tyr Arg Phe lie Asn 
30 Lys Lys De Pro Lys Gin Arg Leu Glu Ser Tyr Arg Arg Thr Thr Ser Ser His Cys Pro 

Arg Glu Ala Val lie Phe Lys Thr Lys Leu Asp Lys Glu He Cys Ala Asp Pro Thr Gin 



AGCAGAGGGG CTGAGACCAA ACCAUAAACC TCCAATTCTC ATGTGGAAGC 
35 51 CCATGCCCTC ACCCTCCAAC ATGAAAGCCT CTGCAGCACT TCTGTGTCTG 

101 CTGCTCACAG CAGCTGCTTT CAGCCCCCAG GGG CTTG CTC A GCC A GTTOO 
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151 GATTAATACT TCAACTACCT GCTGCTACAG ATTTATCAAT AAGAAAATCC 
201 CTAAGCAGAG GCTGGAGAGC TACAGAAGGA CCACCAGTAG CCACTGTCCC 
251 CGGGAAGCTG TAATCTTCAA GACCAAACTG GACAAGGAGA TCTGTGCTGA 
301 CCCCACACAG AAGTGGGTCC AGGACTTTAT GAAGCACCTG GACAAGAAAA 
5 351 CCCAAACTCC AAAGCTTTGA ACATTCATGA CTGAACTAAA AACAAGCCAT 
401 GACTTGAGA A apa a ata att Tr.TATArrcT CTCCTTTCTC AOnuiuui it 
451 TGAGATTATT TTAATCTAAT TCTAAGGAAT ATGAGCTTTA TGTAATAATG 
501 TGAATCATGG TTTTTCTTAG TAGATTTTAA AAGTTATTAA TATTTTAATT 
551 TAATCTTCCA TGGATTTTGG TGGGTTTTGA ACATAAAGCC TTGGATGTAT 
10 601 ATGTCATCTC AGTGCTGTAA AAACTGTGGG ATGCTrfTCC CTTCTCTACC 
651 TCATGGGGGT ATTGTATAAG TCCTTGCAAG AATCAGTGCA AAGATTTGCT 
701 TTAATTGTTA AGATATGATG TCCCTATGGA AGCATATTGT TATTATATAA 
751 TTACATATTT GCATATGTAT GACTCCCAAA TTTTCACATA AAATAGATTT 
801 TTGTAAAAAA 

15 which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
chemoattractant protein 3 (MCP-3). See: Structural and Functional Identification of Two 
Human, Tumor-derived Monocyte Chemotactic Proteins (MCP-2 and MCP-3) Belonging to the 
Chemokine Family. Jo Van Damme, Paul Proost, Jean-Pierre Lenaerts, and Ghislain 
Opdenakker. J. Exp. Med. 176: 59-65, 1992. 
20 In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 36) of: 

Met Lys Val Ser Ala Ala Arg Leu Ala Val He Leu He Ala Thr Ala Leu Cys Ala Pro 
Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thr Thr Pro Cys Cys Phe Ala Tyr He Ala Arg Pro 
Leu Pro Arg Ala His De Lys Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala 
25 Val Val Phe Val Thr Arg Lys Asn Arg Gin Val Cys Ala Asn Pro Glu Lys Lys Trp Val 

Arg Glu Tyr lie Asn Ser Leu Glu Met Ser 
encoded by the cDNA sequence (SEQ ID NO: 37) of: 

1 CCTCCGACAG CCTCTCCACA GGTACCATGA AGGTCTCCGC GGCACGCCTC 
51 GCTGTCATCC TCATTGCTAC TGCCCTCTGC GCTCCTGCAT CTGCCTCCCC 
30 101 ATATTCCTCG GACACCACAC CCTGCTGCTT TGCCTACATT GCCCGCCCAC 
151 TGCCCCGTGC CCACATCAAG GAGTATTTCT ACACCAGTGG CAAGTGCTCC 

1 i AGGATGGAGA GTCCTTGAAC CTGAACTIAC ACAAATITGC CTGTTTCTGC 
35 351 TTGCTCTTGT CCTAGCTTGG GAGGCTTCCC CTCACTATCC TACCCCACCC 

401 GCTCCTTGAA GGGCCCAGAT TCTGACCACG ACGAGCAGCA OTTACAA \ \ ' 
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45 1 CCTTCCCCAG GCTGGACGTG GTGGCTCAGC CTTGTA ATCC CAGCACTTTG 
501 GGAGGCCAAG GTGGGTGGAT CACTTGAGGT CAGGAGTTCG AGACAGCCTG 
551 GCCAACATGA TGAAACCCCA TGTGTACTAA AAATACAAAA AATTAGCCGG 
601 GCGTGGTAGC GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 
651 AGAATGGCGT GAACCCGGGA GCGGAGCTTG CAGTGAGCCG AGATCGCGCC 



7m ArrnrAPTrr Ary-v^. 



i L,^0 i C i t AAAAAAAAAA 



751 AAAAAAAAAA AAAAAATACA AAAATTAGCC GCGTGGTGGC CCACGCCTGT 
801 AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA 
851 GGTGGAGGCT GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT 
10 901 GACAAAGTGA GACTCCGTCA C A A C A AC.AAC AACAAAAAGC TTCCCCAACT 

951 AAAGCCTAGA AGAGCTTCTG AGGCGCTGCT TTGTCAAAAG GAAGTCTCTA 
1001 GGTTCTGAGC TCTGGCTTTG CCTTX3GCTTT GCAAGGGCTC TGTGACAAGG 
1051 AAGGAAGTCA GCATGCCTCT AGAGGCAAGG AAGGGAGGAA CACTGCACTC 
1101 TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA 
15 1151 AAAATCGGGG 

which corresponds to the cDNA sequence and derived amino acid sequence Human T 
cell-specific protein (RANTES). See Schall et al, J. Immunol. 141, 1013-1025 (1988). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 38) of: 
20 Met Arg He Ser Ala Thr Leu Leu Cys Leu Leu Leu He Ala Ala Ala Phe Ser He Gin Val 

Trp Ala Gin Pro Asp Gly Pro Asn Ala Ser Thr Cys Cys Tyr Val Lys Lys Gin Lys He 
Pro Lys Arg Asn Leu Lys Ser Tyr Arg Arg lie Thr Ser Ser Arg Cys Pro Tip Glu Ala 
Val He Phe Lys Thr Lys Lys Gly Met Glu Val Cys Arg Glu Ala His Gin Lys Trp Val 
Glu Glu Ala He Ala Tyr Leu Asp Met Lys Thr Pro Thr Pro Lys Pro 
25 encoded by the cDNA sequence (SEQ ID NO: 39) of: 

1 ACTGAAGCCA GCTCTCTCAC TCTCTTTCTC CACCATGAGG ATCTCTGCCA 
51 CGCTTCTGTG CCTGCTGCTC ATAGCCGCTG CTTTCAGCAT CCAAGTGTGG 
101 GCCCAACCAG ATGGGCCCAA TGCATCCACA TGCTGCTATG TCAAGAAACA 
151 AAAGATCCCC AAGAGGAATC TCAAGAGCTA CAGAAGGATC ACCAGTAGTC 
201 GGTGTCCCTG GGAAGCTGTT ATCTTCAAGA CAAAGAAGGG CATGGAAGTC 
251 TGTCGTGAAG CCCATCAGAA GTGGGTCGAG GAGGCTATAG CATACTTAGA 



- . ACTG1TGATG AAA 1'UTGTTG ATCACGGTCC TAAGGGATAG GAGCTGTCTG 
451 TAGGAATGTG AAACAGTCAC GCCTAAGGAA TGGTCTTTAA GTTATTAATA 
501 TTTTTATTTA ATTAGCCATG TACTTTGGTG TGATTTGA AT GTA A AGCTfT 
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551 GGAGACCTCA TGTCACTTTA ACATTGTGTT AGCTGCAGAA TTC 
which corresponds to the cDNA sequence and derived amino acid sequence human fic (growth 
factor-activated gene). Sec Heinnch et al., Molecular and Cellular Biology 13: 2020-2030, 
1993. 

i In another aspect, the present invention provides a heparanasc having the amino acid 

sequence (SEQ ID NO: 40) of: 

Asp Ser Val Ser He Phe lie Thr Cys Cys Phe Asn Val lie Asn Arg Lys He Pro He Gin 
Aig Leu Glu Ser Tyr Thr Arg He Thr Asn He Gin Cys Pro Lys Glu Ala Val He Phe Lys 
Thr Gly Lys Glu Val Cys Ala Asp Pro Lys Glu Arg Trp Val Arg Asp Ser Met Lys His 
Lys Asp Gin He Phe Gin Asn Leu Lys Pro 
which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
chemoattractant protein 2 (MCP-2). See VanDamme et al., J. Exp. Med. 176: 59 - 65, 1992. 

The purified heparans of the present invention, allows for the convenient selection of 
compounds having anti-heparanase activity (AHA compounds), i.e. inhibitors of heparanase 
activity (IHA), by measuring inhibition of heparanase activity. Inhibition of heparanase activity 
can be measured utilizing in vivo radiolabeled heparan sulfate/heparin. This ligand is 
radiolabeled to high specific activity by intraperitoneal injection of 0.5mCi of S-35 sulfate into 
C57 mice bearing a 1-2 cm basement membrane tumor (EHS; Engelbreth, Holm, Swarm tumor). 
The tumor is harvested after 16 hours and the heparan sulfate proteoglycan extracted in 4 
volumes of 6M urea, 20mM Tns pH 6.8. protease inhibitors. 0.15M NaCl and 0.5% tnton X- 
100. The urea extract is chromatographed on an anion exchange column and the proteoglycan is 
eluted in a linear gradient of NaCl. The radiolabeled proteoglycan is exchanged into a solution 
of 4.0M guanidine-HCl, 20rnM Tris P H 7.4 and applied to a size exclusion column. The 
proteoglycan peak is pooled and exchanged into 0.1 5mM NaCl and 20mM Tris pH7.4. 

Purified, radiolabeled proteoglycan is coupled to commercially available agarose support. 
A quantitative assay of heparanase activity is constructed with the radiolabeled ligand in a multi- 
well format. Briefly, known quantities of recombinant heparanase are added to a multi-well 
plate containing equal amounts of radiolabeled ligand in each well. Enzyme-ligand interaction 
proceeds overnight and the ligand-agarose complex is recovered by centrifugation. Radioactivity 
in the liquid phase is determined by scintillation counting and is the measure of enzyme activity. 
Potential enzyme inhibitors can be evaluated by adding the compound to the solution phase or 



n a(W, " 0n - '" e Pur " ,ed h< * aran;i - * «* ^ pennon can oe used tor therapeutic 
35 wound healing or can be immobilized onto fillers and used to degrade hepann from the blood of 
patients post-surgery. 
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Wound treatment can be achieved by administration to an afflicted individual an 
effective amount of a pharmaceutical composition comprising the purified heparanase in 
combination with a pharmaceutically acceptable, preferably slow releasing, carrier. See. e.g. 
PCT/US90/04772, incorporated herein by reference. 

5 Immobilization onto filters can be achieved by the methods well known in the art 

including those disclosed hv T 

Applications, eds. Cooper et al, pp 493-509, 1982 and those described in U.S. Patent No. 
4,373,023, 4,863,611 and 5.211,850 (all incorporated herein by reference). 

The purified heparanase of the subject invention can be prepared by the method 
10 described in procedure A or procedure B, but preferably procedure A. 

PROCEDURE A 

Reverse transcription of the mRNA from activated human leukocyte-derived cells 
[preferably lymphocytes, neutrophils, platelets, Juriatt lymphoma cells, Dami cells (Greenberg et 
al., Blood 72:1968-1977, (1988)] is used to prepare the cDNA for the desired heparanase 
15 enzyme (preferably SEQ. ID. NO: 1; optionally SEQ ID. NO: 3, SEQ. ID. NO: 5, SEQ. ID. 
NO: 7; SEQ. ID. NO: 13, SEQ. ID. NO: 15, SEQ. ID. NO: 17, SEQ. ID. NO: 19, SEQ. ID. 
NO: 21, SEQ. ID. NO: 23. SEQ. ID. NO: 25, SEQ. ID. NO: 27, SEQ. ID. NO: 29; SEQ. ID. 
NO: 31. SEQ. ID. NO: 33 or SEQ. ID. NO: 35), employing standard PCR cloning techniques 
(described in Sambrook et al., in: Molecular Cloning, A Laboratory Manual. Second Edition, 
1989. Cold Spring Harbor Press). The cDNA encoding the heparanase enzyme is cloned into 
Xbal/BamHl sites in the commercially available baculovirus vector pVL 1392 (Pharmingen; 
San Diego, CA). High titer infectious virus is selected for use in infecting sf9 insect cells 
(Luckow and Summers, BiolTechnology. 6,47 1988). Serum-free medium conditioned by 
infected sf9 cells is collected after 72 hours. This media is the starting material for purification 
of recombinant heparanase. Serum-free conditioned media is adjusted to contain 20mM Sodium 
Acetate, pH 5.0, 0.15M NaCl, ImM reduced glutathione (GSH). ImM dithiothreitol (DTT) and 
lOmM beu^ctylglucoside. Medium is applied to a column of cation-exchange resin 
(Pharmacia) and eluted from the column in a linear gradient of NaCL Fractions containing 
heparanase are pooled and diluted to a final salt concentration of 0.15M NaCl. To this sohmon 
30 is added 20mM Tris and the P H adjusted to 7.0. The solution is a FP .ied to a column of 
heparin-Sepharose (Pharmacia) and eluted with a linear salt gradient buffered to pH 5.0 with 



20 



25 
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was heparana.se a a,vny pc: ug protein; incubation in me presence of transglutaminase, under 
35 reducing conditions, in accordance with the procedure in Example 2, Part C. 
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PROCEDURE B 

This procedure describes the purification to homogeneity of heparanase (SEQ. ID. NO: 
1) from human blood cells or cell lines (such as platelets) under reducing conditions which 
allow for the occurrence of post-translational modifications that increase the specific activity of 
5 heparanase and make it suitable for use in the above described screening assay. The cells are 

allows for the release of enzymes and cytokines from the cell. Reducing agents are added to the 
supernatant from the activated cells. Suitable reducing agents include dithiothreitol (DTT), 
dithioerythritol (DTE), reduced glutathione (GSH), and f}-mercaptoethanol. The reduced, 

10 activated supernatant is chromatographed on a column of immobilized heparin or heparan sulfate 
under reducing conditions at pH 5, using a salt gradient (such as NaCl, KG, or other salt) to 
elute the bound proteins. Fractions containing heparanase activity are pooled and exchanged 
into any buffer appropriate for the pH of 6.8 and containing 0.15 M NaCl, reducing agents, and 
non-ionic detergent. This is passed over any suitable ani on-exchange column (bed volume of 5 

15 ml or less). The unbound material from this column is adjusted to pH 5 with acid, and is 

loaded onto any suitable cation-exchange column (bed volume of 5 ml or less), equilibrated in a 
suitable pH 5 buffer containing 0.15 M NaCl, reducing agents, and non-ionic detergents. The 
bound protein is eluted from the column with a salt gradient, and the fractions containing 
heparanase activity are pooled and size fractionated to below 30,000 daltons with 30 K-cut-off 

20 membranes. The protein below 30,000 daltons is concentrated by either heparin-sepharose 
chromatography or by centrifugation through 5 K-cut-off membranes. 

The present invention is seen more fully by the examples set forth below. 
Example 1 : Use of Heparanase as a screen for AHA compounds. 

1. Heparan sulfate, metabolically labeled (S-35) to a high-specific activity- as described 
25 above for the EHS tumor, prepared by papain digestion of chromatographicaUy purified heparan 

sulfate proteoglycan is coupled to cyanogen bromide activated Sepharose-6B (Pharmacia) 
according to manufacturer's instructions. 

2. 35 S-Heparan sulfate-Sepharose 6B is resuspended in: 0.15 M NaCl, 0.03% human 
serum albumin, 10 uM MgClj, 10 uM CaCl 2 , antiproteolytic agents (1 ug/ml leupeptin, 2 ug/ml 

30 antipain, 10 pg/ml benzamidine, 10 units/ml aprotinin, 1 pg/ml chymostatin, and 1 ug/ml 
pepstarin), and 0.05 M Na acetate, pH 5.6 and 5,000 cpm, in a total volume of 200 pi, are 



reparation ot digested product is accomplished by centrifugation of the 90 well 
35 plate. The supernatant, containing cleaved heparan sulfate is decanted and quantitated by 
scintillation counting. 
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4. Inhibitors of hcparanasc activity can be introduced into the liquid-phase of the 

assay. 

5. A potential inhibitor of heparanase activity would be identified by its ability to 
reduce the amount of radiolabeled heparan sulfate released into the supernatant by 50% at a 

5 concentration of 1 pM or less. 

Examnle 2: Thp nrpnaratinn Vn.~n.~- J ~: ;;i „. e wl »vl.wuiio ao uuuiucu ill 

Procedure B. 

Part A: 

Platelet-rich plasma (10 9 platelets/ml; 1800 r 1) is obtained from healthy, i .formed 
10 volunteers by plasmapheresis. The plasma is removed from the platelets by centrifugaticn 

(Heldin, et al, Exp. Cell Res. 109: 429-437, 1977). Platelets suspended in phosphate buffered 
saline (PBS; 0.1 original volume) are then stimulated with 1 U/ml thrombin for 5 min at 37°C. 
This concentration of thrombin was reported to release 100% of the heparanase activity from 
platelets (Oldberg, et ai, Biochemistry 19: ^755-5762, 1980). Following activation, the 
15 thrombin is inactivated by the addition of 100 mM phenylmethylsulfonylfluoride (PMSF), and 
the platelets are centrifuged at 2000 x g for 30 min at 4°C. The supernatant is stored at -80°C 
until used for the chromatographic purification of heparanase (Part B). 
Part B: Chromatographic purification of heparanase. 

1 . Heparin-Sepharose Chromatography. Activated platelet supernatants are pooled and 
20 adjusted to contain 1 mM GSH and 1 mM DTT. This pool is loaded (2.5 mL'min) onto a 

column of heparin-sepharose (2.6 x 7.5 cm, 40 ml) equilibrated in 1 mM GSH, 1 mM DTT, 150 
mM NaCl, 10 mM NaPO^ pH 7.4. After loading the sample, the column is washed with 200 
ml of 0.15 M NaCl, 1 mM GSH, 1 mM DTT, 10 mM Na acetate, pH 5, followed by 60 ml of 
0.35 M NaCl, 1 mM DTT, 1 mM GSH, 10 mM Na acetate, pH 5. The column is then eluted 
25 with a 160 ml linear gradient between 0.35 M NaCl and 1.5 M NaCl in the same buffer. 

Aliquots of each fraction are used for detennination of heparanase activity by the 'Purification 
Assay" described later. 

2. Anion-exchange chromatography (For example, DEAE-Sephacel, Pharmacia). The 0.9 
M - 1.15 M NaCl fractions from the heparin-sepharose column are concentrated using a stirred 

30 cell fitted with a PM-10 membrane, and the buffer is exchanged to 0.15 M NaCl, 1 mM DTT, 1 
mM GSH, 10 mM p-octylglucoside, 10 mM sodium phosphate, pH 6.8 (8 ml). This sample is 

v-rougn and wash with equilibration Duller are collected as one pool. The column is then eluted 
3b with 10 ml of 0.15 M NaCl, 10 mM P-octylglucoside, 1 mM GSH, 1 mM DTT, 10 mM Na 
acetate, pH 5, followed by 10 ml of 1.5 M NaCl. 10 mM P-octylglumsidr. 1 mM GSH. 1 mM 
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DTT, 10 mM Na acetate, pH 5. Aliquots of each pool are used for determination of heparanase 
activity by the "Purification Assay". 

3. Cation Exchange. The unbound sample from the DEAE-Sephacel column is adjusted to 
pH 5 with glacial acetic acid and loaded onto a cation exchange column (Poros HS/F, 4.6 mm x 
50 mm; PerSeptive Biosystems), pre-equilibrated with 0.15 M NaCl, 1 mM DTT, I mM GSH, 

- - , r ~ uiv uuw i6 j,u iiu/min ana J ml 

fractions are collected. After washing the column with 35 ml of equilibration buffer, the column 
is developed with a 55 ml linear gradient between 0.15 M and 1.5 M Nad in the same buffer. 
10 pi aliquots of the fractions are used for determination of heparanase activity by the 
10 'Purification Assay". 

4. Size fractionation to < 30 kD and concentration on immobilized heparin (Hi-trap 
heparin-sepharose, Pharmacia). The activity from the Poros HS/F column is size fractionated by 
centrifuging through 30,000 molecular weight cut-off filters (Millipore ultrafree-MC 30,000 
NMWL filter units). The < 30 kD pool is diluted to contain 0.15 M Nad, and is loaded onto a 
15 1 ml Hi-trap heparin column, pre-equilibrated with 0.15 M NaCl, 1 mM DTT, 1 mM GSH, 10 
mM Na acetate, pH 5. The column is eluted with 1.2 M NaCl in the same buffer and the single 
eluted peak contains the heparanase activity. 

Part_Cl Properties of the purified heparanase. 

The final yield of heparanase protein from 1 850 ml platelet-nch plasma was 2.7 mg. 
") Protein concentration was determined by the method of Lowry (J. Biol. Chem. 193: 265-275, 
1951), or if more precise determinations were required, by amino acid analysis on an amino acid 
analyzer (Beckman 6300). The overall recovery of activity was 8%, with a 4150-fold 
purification. The preparation was judged to be homogeneous by the presence of a single band 
of 9000 daltons on an 18% silver-stained SDS-polyacrylamide gel, run according to the method 
of Laemmli (Nature 111: 680-685,1970). 

The pH optimum of the purified heparanase was determined by conducting the 
'Purification assay" activity between pH 3.5 and 8.0, using a citrate buffer (pH 3.5 - 6.0). 
citrate-phosphate buffer (pH 6.5 - 7.0), and phosphate buffer (pH 7.5 - 8.). Heparanase was 
active between pH 5.0 and 8.0, with the optimum pH at 5.8. 

N-terminal amino acid sequencing of heparanase produced by this procedure was 
performed using a gas/liquid phase Protein Sequencer (Applied Biosystems Inc. Model 4701 



'-rmmal arnin,, acui sequences oi the heparanase produced in this example were 85 % 
SEQ. ID. NO: 9 (namely: 
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Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp J^eu Tyr Ala 
15 10 15 

Glu Leu Arg), 

which is identical to CTAP-III, and 15% SEQ. ID. NO: 10 (namely: 

5 Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu), 
15 10 15 

p^uiouj luiiii, pmicict uasic proiein. interestingly, the N-terminal sequence of 

commercial (}-thrornboglobulin (namely, Calbiochem (Cat. # 605165), Celsus Laboratories (Cat. 

# 41705), and Haematologic Technologies (Cat. # HBTG-0210), which has low levels of 

30 heparanase activity, was 100% SEQ. ID. NO: 11 (namely: 

Asn Leu Ala Lys Gly Lys Glu Glu Scr Leu Asp Set Asp Leu Tyr Ala Glu), 

1 .trniS 10 15 

indicating that $ie commercial preparation is actually CTAP-HT and not f$-thromboglobulin. 

Chromatofocusing of the heparanase produced by this procedure results in two peaks of 
15 differing isoelectric points. To perform the chromatofocusing, heparanase is dissolved in 0.025 
M imidazole, pH 7.3. The sample is loaded onto a 0.5 x 20 cm column of Polybuffer 
Exchanger 94 (Pharmacia), equilibrated with 0.025 M imidazole, pH 7.3. Immediately after 
sample loading, Polybuffer 74 (Pharmacia; 1:8, pH 4) is pumped onto the column at 0.5 ml/min. 

2 ml fractions are collected, and the pH of each fraction is determined by a narrow range (pH 4- 
20 7) pH paper. Aliquots of each fraction are used to determine heparanase activity by the 

"Purification Assay." All of the activity is associated with an absorbance (280 run) peak that 
eluted at pH 4.8 to 5.1, representing approximately 10% of total protein, while 90% of the 
protein is eluted at pH 7.3 and is inactive. Aliquots of each protein peak are separated from the 
ampholytes by C 4 reverse phase chromatography. The peak that eluted from the chromato- 

25 focusing column at pH 7.3 has ^-terminal sequences for platelet basic protein and the processed 
form, CTAP-HI. The peak that is eluted from the chromatofocusing column at pH 4.8 - 5.1 also 
contains the sequences of platelet basic protein and the processed form, CTAP-III. All of the 
platelet basic protein processed forms have pl*s that are calculated and reported to be greater 
than 7.6. Thus, the heparanase activity resides in the platelet basic protein and/or the processed 

30 form, CTAP-III that is modified such that the pi is lowered to 4.8 - 5.1. 

Heparanase obtained after cliromatofocusing exhibited a specific activity of 80 unitt ;, .ig 
protein, using the 'Purification Assay." This represents a 1000-fold increase in the specific 

fif^paranasc is ADP-nbosylation. ADP-nbosyiauon ^Adenine diphosphates bosylation) is a post- 
trans lational modification of proteins or DNA in which the ADP-ribose group of NAD 
(Nicotinamide adenine dinucleotide) is enzymaticallv transferred to proteins or DNA Sin:c t!v 
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modification adds two negatively charged phosphate groups to a molecule, it would result in a 
lower isoelectric point. Activated platelet supematants were incubated in 1 mM DTT, 2 mM 
MgC) 2 , 100 mM HEPES, pH 7.4, and 0.5 uM [ 32 P]NAD (Specific activity « 1000 Ci/mmol). 
The labeled proteins were separated by SDS-polyacrylamide gel electrophoresis on an 18% gel, 
5 transferred to PVDF (polyvinylidene difluoride) membrane, and exposed to X-ray film. The 
autoradiogram demonstrates the incorporation of T 32 P1 intn « n™r«;« ~* oaaa 7-^ 
h'VDF membrane was immunoblotted with the anti-Peptide C antisera (1:1500 in PBS 
containing 5% dry milk, 0.05% Tween-20, 0.15 M NaCI, 20 mM Tris, pH 7.4, 2 hours at room 
temperature, followed by incubation with peroxidase-labeled goat anti-chicken IgG (1:500 in 
10 above buffer, 1 hour room temperature), and reacted with a peroxidase substrate. The 

immunobiot revealed that the 8000 dalton that was labeled with [ 32 P] was CTAP-m/heparanase. 
The addition of 200 uM sodium nitroprusside, a spontaneous releaser of nitric oxide, to the 
ADP-ribosylation reaction resulted in 5-fold more incorporation of [ 32 P] label into CTAP- 
Ill/heparanase, suggesting that this modification can be regulated in vivo by nitric oxide. 
15 Finally, in an analgous manner to that of glyceraldehyde-3-phosphate dehydrogenase, another 
platelet ADP-ribosylated glycolytic enzyme (Zhang and Snyder, Proc. Natl. Acad. Sci. USA 89: 
9382-9385), it was determined that CTAP-m/heparanase has an auto-ADP-ribosylation activity, 
since the f P]-ADP-ribosylation of CTAP-III/heparanase occurs in reactions where the only 
protein present is commercial CTAP-III or purified heparanase. Other chemokine family 
10 members tested, which includes IP-1-, IL-8, gro-a, and MCAF, also have auto-ADP-ribosylation 
activity. 

It is contemplated that the high specific activity of CTAP-III/heparanase is a 
consequence of ADP-ribosylation of the enzyme in the presence of nitric oxide. It is further 
contemplated that the action of transglutaminase on the ADP-ribosylated enzyme will lead to 
5 further increase in the specific activity. 

An amino acid composition of the heparanase produced in Example 2 gave the expected 
amino acid composition for CTAP-HI and N-terminal sequencing revealed sequences for platelet 
basic protein and the processed form, CTAP-m , confirming that the heparanase activity is 
contained in this set of processed proteins and is not due to a minor contaminant. The presence 
0 of heparanase activity in three commercial sources of P-thromboglobulin also confirms this 
conclusion. In addition, polyclonal antibodies to p-thromboglobulin were found to precipitate 

30 - 70% of the hppnrnnn^ i-r^'i" ^ *\ r 

... i. * i *4 l 1 ■. 

... iutivauon i;i ncparanase wnn transglutaminase i^prepaieu in accordance with 
. Example 2, Part B) results in a substantial (about 13-fold) increase in the specific activity of the 
enzyme. The heparanase (2 u\ at 56 nM) obtained by Example 2, Part B is treated with either 
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transglutaminase from guinea pig liver (4 mU; Sigma) or with Factor XIII (1 fig; Celsus 
Laboratories, Inc.), the blood coagulation factor that is activated by treatment with 5 units of 
thrombin at 37 degrees for 30 minutes. Heparanase is activated by incubation of either 2mU 
liver transglutaminase or 5 units of activated Factor XTTI in the presence of 0.1 M NaAcetate 
5 buffer at pH 6.0 containing ImM reduced glutathione and ImM CaCl for 35 minutes at 37 



vr- — 

increase in the specific activity of the heparanase. 

The high degree of sequence identity between CTAP-III ar Imerleukin-8, a CXC 
chemokine family member, assures that an essentially identical foldini pattern will be shared by 

10 the two proteins. Since the 3-dimensional structure of lntHrIeiikin-8 is known (Clore, ct al., 
Biochemistry 29: 1689-1696,1990; Baldwin, et al, J, B iol Chem. 265: 6851-6853), one can 
model the same for CTAP-III. Such a model can serve to direct r=' arch into rationally 
designed IHA and to help explain the action of transglutaminase in activating the CTAP-HI. 
Part D: Purification Assay for Heparanase Activity 

15 Heparanase activity from platelets or column fractions is detected by its ability to digest 

the > 70 kD 35 S-HSPG to produce lower molecular weight products. Each digest contains 10 ul 
sample, 35 S-HSPG (2000 cpm), 0.15 M NaCl, 0.03% human serum albumin, 10 pM MgCl 2 , 10 
uM CaCl 2( antiproteolytic agents (1 pg/ml leupeptin, 2 ug/ml antipain, 10 ug/ml benzamidine, 
10 units/ml aprotinin, 1 ug/ml chymostatin, and 1 ug/ml pepstatin), and 0.05 M Na acetate, pH 

20 5.6 in a total volume of 300 pi. Digests are carried out for 3 to 21 h. The presence of lower 
molecular weight radiolabeled products is detected by centrifugation through 30,000 MW-cutoff 
filters. The digests containing 2000 cpm of 35 S-HSPG (> 70 K) are centrifuged through 30,000 
molecular weight cut-off filters (Millipore Ultrafree-MC 30,000 NMWL filter units). 35 S-HSPG 
degradation is evident by the presence of radioactivity in the filtrate that passed through the 30 

25 K membrane; this heparanase activity is expressed as the % of total cpm < 30 K for a given 
digest Analysis of heparan sulfate degradation by this method is quick and reproducible, 1 
unit of heparanase activity is defined as 1% cpm < 30 K per h. For pH optimum determination, 
the 0.1 M Na acetate buffer is replaced by 50 mM citrate, citrate-phosphate, or phosphate buffer 
at varying pH's. For samples from chromatographic steps performed under reducing conditions 

30 (1 mM GSH, 1 mM DTT), the concentration of a thiol oxidant (diamide) needed for optimum 
activity is determined. This concentration (100 uM diamide) is added to all assay tubes when 



S-HSPG (>/0 K) is prepared from mice bearing a basement membrane tumor that 
35 overproduces HSPG (EHS tumor), using modifications of the method of Ledbetter, et. al., 1987. 
Briefly, the radiolabeled HSPG was prepared by injecting C57BL mice bearing tin' EHS turner 



WO 95/04158 



PCT/US94/08207 



-28- 

(Orkin, etal., 1977) with sodium [ 35 S]sulfate (0.5 mCi/mouse) 18 h before harvesting the tumor. 
The HSPG is extracted from the weighed tumor with 6 volumes (w/v) of Buffer A (3.4 M NaCl, 
0.1 M o^arninohexanoic acid, 0,04 M EDTA, 0.008 M A^ethylmaleimide, 0.002 M PMSF, and 
0.05 M Tris-HCl, pH 6.8), by homogenization with a Polytron for 30 s, followed by stirring at 
4°C for 1 h. Insoluble material is collected by centrifugation (12,000 x g for 10 min), and the 

supernatant is discarded. Th*», insoluble rpciHuA ic f«v^^.j ^ 1 iuuivi 

weight) of Buffer A for 30 min with stirring at 4°C. Insoluble material is again collected by 
centrifugation, and the supernatant fraction is discarded. The insoluble material is then 
suspended in 6 volumes of Buffer B (6 M urea, 0.1 M 6-aminohexanoic acid, 0.04 M 
e%lenediaminetetraacetic acid (EDTA), 0.002 M PMSF, and 0.05 M Tris-HCl ; nH 6.8), 
homogenized with an electric homogenizer (Polytron) for 30 s, and stirred for 2 h at 4°C. The 
mixture is centrifuged to remove insoluble material, and the supernatant is retained. The 
insoluble material is reextracted with 2 volumes of Buffer B. The mixture is centrifuged, and 
the supernatant is combined with the previous supernatant. 

35 S-HSPG is isolated from the Buffer B supernatant by sequential chromatography on 
anion exchange and gel filtration columns. The Buffer B supernatant is dialyzed overnight 
against 10 volumes of 6 M urea, 0.15 M NaCl, 0.05 M Tris-HCl, pH 6.8, and is adjusted to 
contain 0.5% non-ionic detergent (Triton X-100). This supernatant (from 11 g tumor) is 
chromatographed on a 30 ml column of anion exchange resin (DEAE-Sephacel) equilibrated 
with 6 M urea, 0.15 M NaCl, 0.05% Triton X-100, 0.05 M Tris-HCl, pH 6.8. After loading the 
supernatant and washing with the equilibration buffer, the column is developed with a 250 ml 
linear gradient between 0.15 M NaCl and 1.15 M NaCl (How « 2.0 ml/min). Fractions are 
sampled for radioactivity, and those containing the 35 S0 4 label that elutes from the DEAE- 
Sephacel between 0.4 M and 0.8 M NaCl are pooled. The proteoglycan is precipitated by the 
addition of 4 volumes of 100% EtOH at -20°C overnight. The precipitate is collected by 
centrifugation and is solubilized in 1 ml of Buffer C (4 M Gu-HCl, 20 mM Tris-HCl, pH 7.2). 
This solubilized pellet is used for chromatography on a calibrated gel filtration column (1.0 x 50 
cm column of Superose 6; Pharmacia) equilibrated in Buffer C (Flow - 0.5 ml/min). Fractions 
are sampled for radioactivity, and those containing the 35 S0 4 label that elutes with a molecular 
weight £ 70 kD were pooled. The proteoglycan is precipitated with 100% EtOH as described 
above. The pellet is dissolved in 3 ml PBS, and dialyzed against 3 x 100 volumes of PBS. 

sample ^ Reparation or cDNA encoding Heparanase. 

Media is removed from cultured HEL (HEL 92.1.7; Human erythroleukerrua; ATCC No. 
TIB 380) cells stimulated with lOnM phorbol 12-myristate 13-acetate (Sipma Chcmi:?.! Co St 
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Louis, MO) and the cells scraped from the dish and pelleted by centrifugation. The pellet is 
extracted with 200wl of TRI reagent (Molecular Research Center Inc. Cincinnati, OH) and the 
total cellular RNA is prepared according to the manufacturer's instructions. To prepare first 
strand synthesis the reverse transcriptase reaction was performed with lOwl of total cellular RNA 
5 in the presence of Au\ of 5x transcripJase buffer (Bethesda Research Laboratories, Gaithersburg, 
MD\ 1*1 0 9mM TYTT A„\ ™A~~ ^w^jLui* C^. Ailii^iuii ncigms, li-Lj, 

and lwl lOmM dNTP (BRL). This solution is heated to 95 degrees C for 5 minutes and then 
placed on ice. To this is added \u\ RNAsin and \u\ reverse transcriptase (M-MLV), (Promega, 
Madison WI). This is incubated at 37 degrees for 60 minutes and then placed on ice. The 

10 polymerase chain reaction is carried out as follow? To 3id of the first strand (above) is added 
lwl of each Primer (see below), 11 u\ of water lOwl lOx PCR buffer (Perkin Elmer Cetus, 
Norwalk CT) and 2wl each dNTP. This solution is heat denatured at 95 degrees C and Id 
Arnplitaq DNA polymerase (Perkin Elmer Cetus) is added. Hybridization temperature begins at 
72 degrees and is lowered by one degree per cycle until reaching 55 degrees. Each 

15 hybridization step is followed with a constant elongation temperature of 72 degrees. Upon 
completion the solution is left at 0 degrees until storage at -20 degrees. The products of the 
PCR reaction are electrophoresed on 3% NuSieve, 1% agarose gels and bands of expected size 
are excised and purified by standard procedures. 
Primers: 

20 Platelet Basic Protein: TGG ACT AGT ATG TCC TCC ACC AAA GGA CAA ACT AA 
CTAP ID: TGG ACT AGT ATG AAC TTG GCG AAA GAG GA 
B-thrombglobulin: TGG ACT AGT ATG GGC AAA GAG GAA AGT CTA GAC AG 
NAP-2: TGG ACT AGT ATG GAA CTC CGC TGC ATG TGT ATA AA 
Example 4: Preparation of cDNA encoding Heparanase. 
25 Media is removed from cultured leukocyte-derived cells [e.g., lymphocytes, neutrophils, 

platelets, Jurkatt lymphoma cells, Dami cells (Greenberg et al., Blood 72:1968-1977, (1988)], 
stimulated ->vuh Concanavalin A or phorbol 12-myristate 13 acetate (Sigma Chemical Co., St 
Louis, MO) and the cells scraped from the dish and pelleted by centrifugation. The pellet is 
extracted with 200wl of TRI reagent (Molecular Research Center Inc. Cincinnati, OH) and the 
30 total cellular RNA is prepared according to the manufacturer's instructions. To prepare first 
strand synthesis the reverse transcriptase reaction was performed with lOul of total cellular RNA 

iiu. \ui iOmM dNTP (BRL;. "Ihis solution is heated to 95 degrees C for 5 minutes and then 
placed on ice. To this is added lwl RNAsin and \u\ reverse transcriptase (M-MLV), (Promega, 
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Madison WI). This is incubated at 37 degrees for 60 minutes and then placed on ice. The 
polymerase chain reaction is carried out as follows. To 3u\ of the first strand (above) is added 
ltd of each Primer (see below), 77ul of water lOwl lOx PCR buffer (Perkin Elmer Cetus, 
Norwalk CT) and 2ul each dNTP. This solution is heat denatured at 95 degrees C and lul 

5 Amplitaq DNA polymerase (Perkin Elmer Cetus) is added. Hybridization temperature begins at 

72 degrees and is lowered hv r>™ ]_ lfc) " Jc 5 iu,». i^acu 

hybridization step is followed with a constant elongation temperature of 72 degrees. Upon 
completion the solution is left at 0 degrees until storage at -20 degrees. The products of the 
PCR reaction are electrophoresed on 3% NuSieve, 19c agarose gels and bands of expected size 

0 are excised and purified by standard procedures. 
Primers: 

Platelet Basic Protein: TGG ACT AGT ATG TCC TCC ACC AAA GGA CAA ACT AA 
CTAP IE: TGG ACT AGT ATG AAC TTG GCG AAA GAG GA 
B-thrombglobulin: TGG ACT AGT ATG GGC AAA GAG GAA AGT CTA GAC AG 
5 NAP-2: TGG ACT AGT ATG GAA CTC CGC TGC ATG TGT ATA AA 

All temperatures expressed throughout the subject specification are in degrees Centigrade. 

The cDNA encoding heparanase is preferably cloned into a vector designed for 
expression in eukaryotic cells, rather than into a vector designed for expression in prokaryotic 
cells (e.g. E. coli). Eukaryotic cells are preferred for expression of genes obtained from higher 
eukaryotes because the signals for synthesis, processing, and secretion of these proteins are 
usually recognized, whereas this is often not true for prokaryotic hosts (Ausubel, et al. t ed., in 
Short Protocols in Molecular Biology, 2nd edition, John Wiley & Sons, publishers, pg. 16-49, 
1992.). Eukaryotic hosts may include, but are not limited to, the following: insect cells, 
African green monkey kidney cells (COS cells), Chinese hamster ovary cells (CHO cells), and 
Murine 3T3 fibroblasts. 

Experiments demonstrating that a synthetic peptide of CTAP-m/NAP-2 or antisera 
raised against a synthetic peptide of CTAP-IH/NAP-2 inhibit the heparanase activity of CTAP- 
UI/NAP-2 suggest that the amino acids participating in enzymatic catalysis are contained in a C- 
terminal region of the enzyme. 

Peptide Synthesis: A C-terminal peptide contained within the sequences known for CTAP-III 
(SEQ ID NO: 1), Platelet Basic Protein (SEQ ID NO: 3), p-thromboglobulin (SEQ ED NO: 5V 

i.vnrding to siandaro proceauic^. Ilic N -terminal peptide has the following sequence (SEQ ID 
NO: 41: Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Cys, in which the final 
Cys residue was added to regions of known sequence (SEQ TT> VOS* } ,^ f^r the purpose >f 
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conjugation to a carrier protein. The C-terminal peptide has the following sequence (SEQ ID 
NO: 42): Cys Asn Gin Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys lie Cys Leu Asp 
Pro Asp Ala Pro Arg lie Lys Lys He Val Gin Lys Lys encoded by the cDNA sequence (SEQ 
ID NO: 43) of TGCAACCAAG TCGA/> OTGAT AGCCACACTG AAGGATGGGA 
GGAAAATCTG CCTGGACCCA GAToCTCCCA GAATCAAGAA AATTGTACAG 
AAAAAA. These peptides (SEO ID NOS- A) ™a a->\ ™« I, ,,p WJS c >uuu pnase 

peptide synthesis on an Applied Biosystems 430A Peptide Synthesizer. The 9- 
fluoroenylmethyloxycarbonyl (Fmoc) group was used as the N a amino protecting group, and 
temporary side-chain protectin groups were as follows: Arg (Pmc), Asn (Trt), Asp (OtBu), Gin 
(Tit), Glu (OtBu), His (Trt), Lys (Boc), Ser (tBu), Thr (tBu). Each residue was single coupled 
using a HBTU/NMP protocol and capped with acetic anhydride before the next synthesis cycle. 
After removal of the N-terminal Fmoc group, temporary side-chain protecting groups were 
removed and the peptide cleaved from the resin by treatment with 95% TFA75% scavengers 
(ethyl methyl sulflde/ani^ : ,2-ethanedithioI, 1:3:1) for two hours at room temperature. The 
15 crude peptides were precipitated from the cleavage solution with cold diethyl ether. The 
precipitated peptide was collected on a sintered glass funnel washed with diethyl ether, 
dissolved in dilute acetic acid, evaporated to dryness under reduced pressure, and the residue 
was redissolved and lyophillized from glacial acetic acid. The crude peptides were purified by 
preparative reverse phase chromatography on a Phenomenex C-18 column (22.5 x 250 mm) 
20 using a water/acetonitrile gradient, each phase containing 0.1% trifluoracetic acid (TFA). Clean 
fractions, as determined by analytical HPLC, were pooled, the acetonitrile was evaporated under 
reduced pressure, and the aqueous solution was lyophillized. The purified peptides were 
characterized by time of flight or FAB mass spectroscopy. 

Further, SEQ ID NO: 42 can be produced by recombinant DNA methodology as stated 
25 in Procedure A (page 21). 

Antisera Production: The synthetic peptides of CTAP-HI/NAP-2 were conjugated to keyhole 
limpet hemocyanin utilizing a maleimide-activated carrier protein (Pierce Chemical Co. #77107). 
300 pg of conjugated peptides were injected into chickens using Freunds complete adjuvant. 
The antisera were collected 5 weeks after initial immunization. Specific recognition by the 
30 antisera of commercial CTAP-m (2.5 pg, (Celsus Laboratories Inc., Cincinnati, Ohio; Cat #. 
41705), isolated heparanase (1.5 pg), and 10 pi of the platelet supernatant used for purification 

'I'll r )!'|)'|l' i ' 1 . ' 

...Mcrrini/ :?» veiiu:^': i n,'UOai :v. :. <]]>.:::■ .a. .. .uu wl. 

v incubation wirn a peroxidase labcJcci goat ami-chicken IgO (1:500; Kierkegaard and Perry) in 
35 the presence of PBS containing 5% dry milk and 0.05% Tween-20. The pre-immune sera did 
not recognize 7 - 10 kD proteins in the commercial CTAP-ITI, isolated heparanase. or platelet 
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supernatants. 

Inhibition of heparanase activity by the C-terminal synthetic peptide (SEQ ID NO: 42) or 
antisera: For experiments designed to determine whether the peptide antisera was able to 
inhibit heparanase activity, the pre-immune and antisera were exchanged into 0.1 5M NaCl, 
5 0.01 M sodium phoshate buffer, pH 7.4 (PBS) using a 100 kD cut-off membrane in order to 

isolated heparanase (15 ng) were pre-incubated for 30 min with 2 pi of either pre-immune or 
anti-CTAP-IH antisera before adding the 35 S-HSPG to determine heparanase activity. In the 
presence of the pre-immune sera, the isolated protein had 14.3 ± 0.1 units of heparanase activity, 

10 while in the presence of the C-terminal peptide antisera. only 0.8 + 0 7 units of heparanase were 
detected (p < 0.001; results confirmed in a second experiment). The N-terminal peptide 
antiserum was not able to neutralize the heparanase activity. Similar results were obtained when 
the ability of the synthetic peptides to neutralize heparanase activity was examined. Heparanase 
assays conducted with 3 nM enzyme, 47 nM 35 S-HSPG substrate, and varying concentrations of 

15 peptides showed that heparanase activity was only 5% of control values in the presence of 250 
pM C-terminal peptide. By contrast, heparanase activity in the presence of 250 pM of either the 
N-terminal peptide or an unrelated peptide (PLALWAR) was 67% of control values. The ability 
of both the C-terminal peptide (SEQ ID NO: 42) or antisera from a chicken immunized with the 
C-terminal synthetic peptide to neutralize heparanase activity demonstrates conclusively that 

20 CTAP-III and NAP-2 possess heparanase activity, and suggests that the C-terminal region is 
essential for catalysis. Modeling of this domain (SEQ ID NO: 42) can be used in the 
identification of potent peptide-mimetic compounds capable of inhibiting this enzyme activity. 

Computer assisted modeling can be accomplished using programs for automated docking 
of molecules within 3D databases, as described in DesJarlais, R.L., Sheridan, R.P., Seibel, G.L., 

25 Dixon, J.S M Kuntz, I.D., Venkataraghavan, R. ( 'Using shape complementarity as an initial screen 
in designing ligands for a receptor binding site of known three-dimensional structure"; J. Med. 
Chem. 31:722-729, 1988. Also, automated de novo construction of ligands that can bind the 
catalytic site as described in Moon, J.B., Howe, WJ. f "Computer design of bioactive molecules: 
a method for receptor-based de novo ligand design"; Proteins: Struct., Funct, and Genetics, 

30 11:314-328, 1981. 
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SEQUENCE LISTING 



5 (1) GENERAL INFORMATION: 

(i) APPLICANT: Hoogewerf, Arlene J. 

Ledbetter, Steven R. 

10 (ii) TITLE OF INVENTION: USE OF HEPARANASE TO IDENTIFY awn 



(iii) NUMBER OF SEQUENCES: 4 3 



15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : The Upjohn Company, Intellectual Property Law 

(B) STREET: 301 Henrietta 

(C) CITY: Kalamazoo 

(D ) STATE: MI 

20 (E) COUNTRY: USA 

(F) ZIP: 49001 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 
30 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: Jameson, William G. 

(B) REGISTRATION NUMBER; 27,199 

(C) REFERENCE/DOCKET NUMBER: 4 731.1 CP 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: 616/385-7561 

(B) TELEFAX: 616/385-6697 

(C) TELEX: 224401 



45 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

50 CC) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Asn Leu Ala Lys Gly .'s Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 

60 1 ^ n 



35 



lie Giy Lys Gly Thr His Cys Asn Gin 



40 



45 



Val Glu Val lie Ala Thr Leu Lys Asp Gly Arg Lys lie Cvs Lei 1 Asp 
50 55 ^ 



5 



10 
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Pro Asp Ala Pro Arg lie Lys Lys lie val Gin Lys Lys Leu Ala Gly 

70 75 80 

Asp Glu Ser Ala Asp 

85 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 255 base pairs 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
AACTTGGCGA AAGGCAAAGA 

ATGTGTATAA AGACAACCTC TGGAATTCAT CCCAAAAACA TCCAAAGTTT GGAAGTGATC 120 
GGGAAAGGAA CCCATTGCAA CCAAGTCGAA GTGATAGCCA CACTGAAGGA TGGGAGGAAA 180 
25 ATCTGCCTGG ACCCAGATGC TCCCAGAATC AAGAAAATTG TACAGAAAAA ATTGGCAGGT 
GATGAATCTG CTGAT 

(2) INFORMATION FOR SEQ ID NO : 3 : 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



40 



Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 

10 15 

Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys Met Cys lie 
45 25 30 

Lys Thr Thr Ser Gly He His Pro Lys Asn lie Gin Ser Leu Glu Val 

4 0 45 

50 tl Y LyS Cly Thr H1S ^ s Asn Gln Val Q lu Val lie Ala Thr Leu 

Lys Asp Gly Arg Lys lie Cys Leu Asp Pro Asp Ala Pro Arg He Lys 

70 75 bo 

Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala Asp 

85 go 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

■ » - * rv^T " o " 

FAN DEC NT 5:' 



55 



60 



240 
255 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
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TCCTCCACCA AAGGACAAAC TAAGAGAAAC TTGGCGAAAG GCAAAGAGGA AAGTCTAGAC 60 

AGTGACTTGT ATGCTGAACT CCGCTGCATG TGTATAAAGA CAACCTCTGG AATTCATCCC 120 

5 AAAAACATCC AAAGTTTGGA AGTGATCGGG AAAGGAACCC ATTGCAACCA AGTCGAAGTG 180 

ATAGCCACAC TGAAGGATGG GAGGAAAATC TGCCTGGACC CAGATGCTCC CAGAATCAAG 24 0 

AAAATTGTAC AGAAAAAATT GGCAGGTGAT GAATCTGCTG AT 2 82 



20 



25 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 81 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys 
15 10 15 

Met Cys lie Lys Thr Thr Ser Gly lie His Pro Lys Asn lie Gin Ser 
20 25 30 



Leu Glu Val He Gly Lys Glv Thr His Cys Asn Gin Val Glu Val He 
30 35 40 45 



Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro 
50 55 60 

Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala 
65 70 75 80 

Asp 

(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24? base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GGCAAAGAGG AAAGTCTAGA CAGTGACTTG TATGCTGAAC TCCGCTGCAT GTGTATAAAG 60 

55 ACAACCTCTG GAATTCATCC CAAAAACATC CAAAGTTTGG AAGTGATCGG GAAAGGAACC 12 0 

CATTGCAACC AAGTCGAAGT GATAGCCACA CTGAAGGATG GGAGGAJ * T CTGCCTGGAC 180 

CCAGATGCTC CCAGAATCAA GAAAATTGTA CAGAAAAAAT TGGCAGGTGA TGAATCTGCT 24 0 

60 



^LQCENCE CHARACTERISIl^ . 

(A) LENGTH : 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Glu Leu Arg Cys Met Cys lie Lys Thr Thr Ser Gly He His Pro Lys 
15 10 15 

Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
20 25 30 

Val Glu Val He Ala Thr Leu Lvs Asd Glv Aro r,v<; no r^t? t oh 
jj 4U 45 

Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 

50 55 60 

Asp Glu Ser Ala Asp 
65 

(2) INFORMATION i'UK SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GAACTCCGCT GCATGTGTAT AAAGACAACC TCTGGAATTC ATCCCAAAAA CATCCAAAGT 6 0 

TTGGAAGTGA TCGGGAAAGG AACCCATTGC AACCAAGTCG AAGTGATAGC CACACTGAAG 120 

GATGGGAGGA AAATCTGCCT GGACCCAGAT GCTCCCAGAA TCAAGAAAAT TGTACAGAAA 180 

AAATTGGCAG GTGATGAATC TGCTGAT 207 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 

15 10 15 

Glu Leu Arg 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 



Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
5 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 



10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 
15 10 15 

15 Glu 



(2) INFORMATION FOR SEQ ID NO: 12: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B ) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: peptide 



30 (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe 
15 10 15 

35 Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu 

20 25 30 



40 



Ala Glu Glu Asp Gly Asp Leu Gin Cys Leu Cys Val Lys Thr Thr Ser 

35 40 45 

Gin Val Arg Pro Arg His lie Thr Ser Leu Glu Val He Lys Ala Gly 

50 55 60 



Pro His Cys Pro Thr Ala Gin Leu He Ala Thr Leu Lys Asn Gly Arc 
45 65 70 75 60" 

Lys He Cys Leu Asp Leu Gin Ala Pro Leu Tyr Lys Lys He He Lys 

85 90 95 

50 Lys Leu Leu Glu Ser 

100 

(2) INFORMATION FOR SEQ ID NO: 13: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 



vau.:a'i\, a o-;:<7v \a~att:^: • ■ a:a:caaa Av,. iattaaaala, 

GTTGCTGCTC CTGCCACTTG TGGTCGCCTT CGCCAGCGCT GAAGCTGAAG AAGATGGGGA 120 
CCTGCAGTGC CTGTGTGTGA AGACCACCTC CCAGGTCCGT CCCAGGCACA TCACCAGr^T 1 c - " 



WO 95/04158 



PCT/US94/08207 



10 



20 



30 



40 



45 
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GGAGGTGATC AAGGCCGGAC CCCACTGCCC CACTGCCCAA CTGATAGCCA CGCTGAAGAA 24 0 

TGGAAGGAAA ATTTGCTTGG ACCTGCAAGC CCCGCTGTAC AAGAAAATAA TTAAGAAACT 30 0 

TTTGGAGAGT TAGCTACTAG CTGCCTACGT GTGTGCATTT GCTATATAGC ATACTTCTTT 3 60 

TTTCCAGTTT CAATCTAACT GTGAAAGAAA CTTCTGATAT TTGTGTTATC CTTATGATTT 420 

TAAATAAACA AAATAAATC 4 39 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) Mor/RrrrT.p TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



25 Met Asn Gin Thr Ala lie Leu lie Cys Cys Leu lie Phe Leu Thr Leu 

15 10 15 



Ser Gly lie Gin Gly Val Pro Leu Ser Arg Thr Val Arg Cys Thr Cys 
20 25 30 

He Ser He Ser Asn Gin Pro Val Asn Pro Val Asn Pro Arg Ser Leu 
35 40 45 



Glu Lys Leu Glu He He Pro Ala Ser Gin Phe Cys Pro Arg Val Glu 
35 50 55 60 



He He Ala Thr Met Lys Lys Lys Gly Glu Lys Arg Cys Leu Asn Pro 
65 70 75 80 

Glu Ser Lys Ala He Lys Asn Leu Leu Lys Ala Val Ser Lys Glu Met 

85 90 95 

Ser Lys Arg Ser Pro 
100 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
60 AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA T^TTT^TH A*~ Trr^^r- 



... - ^.rtA/vfv^.': . TATT '.J^AA^C^A ATTTTGTCCA ; 

JGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 3 00 

TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 36 0 
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-39- 

TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 4 20 

CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

5 GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 54 0 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA 650 
(21 INFORMATION FOR cpn m ^ . i <r . 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



30 



45 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: < \1Q ID NO:16: 

25 Met Ala Arg Ala Ala Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 

15 10 15 



Arg Val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Gly Arg Arg Ala 
20 25 30 

Ala Gly Ala Ser Val Ala Thr Glu Leu Arg Cys Gin Cys Leu Gin Thr 
35 40 45 



Leu Gin Gly lie His Pro Lys Asn lie Gin Ser Val Asn Val Lys Ser 

35 50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 

65 70 75 80 

40 Gly Arg Lys Ala Cys Leu Asn Pro Ala Ser Pro lie Val Lys Lys lie 

85 90 95 



lie Glu Lys Met Leu Asn Ser Asp Lys Ser Asn 
100 105 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1050 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTCGCCAGCT CTTCCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC CCCATGGCCC 60 
60 GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT GCGAGTGGCfl* rTc^rrrr^ 



AGTCCCCCGG ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 300 
AAGCTTGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG ATGCTGAACA 3 60 
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GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC TGGTGGCTGT TCCTGAAGGA 4 20 

GGCCCTGCCC TTATAGGAAC AGAAGAGGAA AGAGAGACAC AGCTGCAGAG GCCACCTGGA 4 80 

TTGTGCCTAA TGTGTTTGAG CATCGCTTAG GAGAAGTCTT CTATTTATTT ATTTATTCAT 54 0 

TAGTTTTGAA GATTCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 60 0 

CTACCTGCAC ACTGTCCTAT TATATTCATT CTTTTTGAAA TGTCAACCCC AAGTTAGTTC 66 0 

RaTrraATT n*T+m™* ZT~:S.ZZr:.Z nniGiiLiLL AliTUATTATG 720 

TTAATATTTC TGAGGAGCCT GCAACATGCC AGCCACTGTG ATAGAGGCTG GCGGATCCAA 78 0 

GCAAATGGCC AATGAGATCA TTGTGAAGGC AGGGGAATGT ATGTGCACAT CTGTTTTGTA 84 0 

ACTGTTTAGA TGAATGTCAG TTGTTATTTA TTGAAATGAT TTCACAGTGT GTGGTCAACA 90 0 

TTTCTCATGT TGAAACTTTA AGAACTAAAA TGTTCTAAAT ATCCCTTtiGA CATTTTATGT 960 

CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT ACAGTGTTTC TGGCTTAGAA 102 0 

CAAAGGGGCT TAATTATTGA TGTTTTCGGA 105 0 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ala Arg Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 
15 10 15 

Arg Val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala 
20 25 30 

Ala Gly Ala Pro Lys Ala Thr Glu Lys Arg Cys Gin Cys Lys Gin Thr 
35 40 45 

Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Lys Val Lys Ser 
50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val He Ala Thr Leu Lys Asn 
65 70 75 80 

Gly Gin Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Lys Lys He 

85 90 95 

He Glu Lys Met Leu Lys 
100 

(2) INFORMATION FOR SEQ ID NO ■ 1 Q ■ 



J ' T R A N D E D N E S £ . c 1 it. i < 
(Uj TOPOLOGY: lineai 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 19: 
CTCTCCTCCT CGCACAGCCG CTCGAACCGC CTGCTGAGCC CCATGGCCCG CGCCACGCTC 
5 TCCGCCGCCC CCAGCAATCC CCGGCTCCTG CGGGTGGCGC TGCTGCTCCT GCTCCTGGTG 
GCCGCCAGCC GGCGCGCAGC AGGAGCGCCC CTGGCCACTG AACTGCGCTG CCAGTGCTTG 
q CAGACCCTGC AGGGAATTCA CCTCAAGAAC ATCCAAAGTG TGAAGGTGAA GTCCCCCGGA 24 0 

- - --^^^r-i nj couunohA A*jUTTUTCTC 300 



AACCCCGCAT CGCCCATGGT TAAGAAAATC ATCGAAAAGA TGCTGAAAAA TGGCAAATCC 
AACTGACCAG AAGGAAGGAG GAAGCTTATT GGTGGCTGTT CCTGAAGGAG GCCCTGCCCT 
TACAGGAACA GAAGAGGAAA GAGAGACACA GCTGCAGAGG CCACCTGGAT TGCGCCTAAT 

GTGTTTGAGC ATCACTTAGG Tirnm* fnmm« ___ 

.M^i^ain iiiAiiiAix TATrTGTTTG 

TTTTAGAAGA TTCTATGTTA ATATTTTATG TGTAAAATAA GGTTATGATT GAATCTACTT 
GCACACTCTC CCATTATATT TATTGTTTAT TTTAGGTCM ACCCAAGTTA GTTCAATCCT 
GATTCATATT TAATTTGAAG ATAGAAGGTT TGCAGATATT CTCTAGTCAT TTGTTAATAT 
TTCTTCGTGA TGACATATCA CATGTCAGCC ACTGTGATAG AGGCTGAGGA ATCCAAGAAA 
ATGGCCAGTG AGATCAATGT GACGGCAGGG AMTGTATGT GTGTC1ATTT TGTAACTGTA 
MGATGAATG TCAGTTGTTA TTTATTGAAA TGATTTCACA GTGTGTGGTC AACATTTCTC 
ATGTTGAAGC TTTAAGAACT AAAATGTTCT AAATATCCCT TGGACATTTT ATGTCTTTCT 
TGTAAGGCAT ACTGCCTTGT TTAATGTTAA TTATGCAGTG TTTCCCTCTG TGTTAGAGCA 
GAGAGGTTTC GATATTTATT GATGTTTTCA CAAAGAACAG GAAAATAAAA TATTTAAAAA 
T 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 

10 15 

Arg val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala 

25 -an 



60 
120 
180 



660 
720 
780 
840 
900 
960 
1020 
1080 
1081 



60 



Pxo ciy Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 

75 80 
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Gly Lys Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Gin Lys He 

85 90 g5 

lie Glu Lys He Leu Asn Lys Gly Ser Thr Asn 
5 100 105 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 
1° (A) LENGTH: 988 base pairs 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 21: 
TTfT Parlor wrnrr i\ r>nr> r»mr»mpnm^»^ r-i - — 

^ - l^^^ uiuiu^xutty L^ttAiijtjtL UAUbCLTKJGC TCTCCGCCGC 60 

CCCCAGCAAT CCCCGGCTCC TGCGGGTGGC GCTGCTGCTC CTGCTCCTGG TGGCCGCCAG 120 

CCGGCGCGCA GCAGGAGCGT CCGTGGTCAC TGAACTGCGC TGCCAGTGCT TGCAGACACT 180 

25 GCAGGGAATT CACCTCAAGA ACATCCAAAG TGTGAATGTA AGGTCCCCCG GACCCCACTG 24 0 

CGCCCAAACC GAAGTCATAG CCACACTCAA GAATGGGAAG AAAGCTTGTC TCAACCCCGC 300 

^ ATCCCCCATG GTTCAGAAAA TCATCGAAAA GATACTGAAC AAGGGGAGCA CCAACTGACA 360 

GGAGAGAAGT AAGAAGCTTA TCAGCGTATC ATTGACACTT CCTGCAGGGT GGTCCCTGCC 420 

CTTACCAGAG CTGAAAATGA AAAAGAGAAC AGCAGCTTTC TAGGGACAGC TGGAAAGGAC 480 

35 TTAATGTGTT TGACTATTTC TTACGAGGGT TCTACTTATT TATGTATTTA TTTTTGAAAG 54 0 

CTTGTATTTT AATATTTTAC ATGCTGTTAT TTAAAGATGT GAGTGTGTTT CATCAAACAT 600 

^ AGCTCAGTCC TGATTATTTA ATTGGAATAT GATGGGTTTT AAATGTGTCA TTAAACTAAT 660 

ATTTAGTGGG AGACCATAAT GTGTCAGCCA CCTTGATAAA TGACAGGGTG GGGAACTGGA 720 

GGGTGGGGGG ATTGAAATGC AAGCAATTAG TGGATCACTG TTAGGGTAAG GGAATGTATG 780 

45 TACACATCTA TTTTTTATAC TTTTTTTTTA AAAAAAGAAT GTCAGTTGTT ATTTATTCAA 840 

ATTATCTCAC ATTATGTGTT CAACATTTTT ATGCTGAAGT TTCCCTTAGA CATTTTATGT 900 

^ CTTGCTTGTA GGGCATAATG CCTTGTTTAA TGTCCATTCT GCAGCGTTTC TCTTTCCCTT 960 
GGAAAAGAGA ATTTATCATT ACTGTTAC 
(2) INFORMATION FOR SEQ ID NO: 22: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 



988 



'UENCL' 



_ A i t ■ i j, 



Met Thr Ser Lys Leu Ala Val Ala Leu Leu Ala Ala Php Leu lie Sfir 
1 5 10 15 
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Ala Ala Leu Cys Glu Gly Ala Val Leu Pro Arg Ser Ala Lys Glu Leu 

20 25 30 

Arg Cys Gin Cys lie Lys Thr Tyr Ser Lys Pro Phe His Pro Lys Phe 
5 35 40 45 

lie Lys Glu Leu Arg Val He Glu Ser Gly Pro His Cys Ala Asn Thr 

50 55 60 

10 Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro 

c * ?: ,j bU 



15 



20 



30 



40 



50 



Lys Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe Leu Lys Arg Ala 

85 90 95 

Glu 



12) INFORMATION FOR SEC ID NO: 23 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ IU NO: 23: 

ATGACTTCCA AGCTGGCCGT GGCTCTCTTG GCAGCCTTCC TGATTTCTGC AGCTCTGTGT 6 0 

GAAGGTGCAG TTTTGCCAAG GAGTGCTAAA GAACTTAGAT GTCAGTGCAT AAAGACATAC 120 

35 TCCAAACCTT TCCACCCCAA ATTTATCAAA GAACTGAGAG TGATTGAGAG TGGACCACAC 180 

TGCGCCAACA CAGAAATTAT TGTAAAGCTT TCTGATGGAA GAGAGCTCTG TCTGGACCCC 24 0 

AAGGAAAACT GGGTGCAGAG GGTTGTGGAG AAGTTTTTGA AGAGGGCTGA G 291 
(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 amino acids 
45 (B) TYPE: amine acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



55 Ala Gly Pro Ala Ala Ala Val Leu Arg Glu Lys Arg Cys Val Cys Leu 

15 10 15 



60 



Gin Thx Thr Gin Gly Val His Pro Lys Met He Ser Asn Leu Gin Val 
20 25 30 



Lys Val He Gin Lys lie Leu Asp Gly Gly Asn Lys Glu Asn 
65 70 75 
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40 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



■ « -»- >-> LIU . 



GTGTTGCGGG AACTGCGGTG CGTGTGTTTA CAGACCACGC AGGGAGTTCA TCCCAAAATG 6 0 

15 ATCAGTAATC TGCAAGTGTT CGCCATAGGC CCACAGTGCT CCAAGGTGGA AGTGGTAGCC 120 

TCCCTGAAGA ACGGGAAGGA AATTTGTCTT GATCCAGAAG CCCCTTTTCT AAAGAAAGTC 180 
ATCCACIAAAA TCCTrazran. rr:rrn?inn^ 

2q www v wwt u^iwi unnxwu 216 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gin val Ser Thr Ala Ala Leu Ala val Leu Leu Cys Thr Met Ala 
15 10 15 

Leu Cys Asn Gin Val Leu Ser Ala Pro Leu Ala Ala Asp Thr Pro Thr 
20 25 30 

Ala Cys Cys Phe Ser Tyr Thr Ser Arg Gin He Pro Gin Asn Phe He 
35 40 45 



Ala Asp Tyr Phe Glu Thr Ser Ser Gin Cys Ser Lys Pro Ser Val lie 

HJ 30 55 SO 

Phe Leu Thr Lys Arg Gly Arg Gin Val Cys Ala Asp Pro Ser Glu Glu 

65 70 75 80 



50 Trp Val Gin Lys Tyr Val Ser Asp Leu Glu Leu Ser Ala 

85 90 

(2) INFORMATION FOR SEQ ID NO: 27: 

5 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

, A (D) topology: linear 

60 



, 1 . i\ ' ' ' T ' ' A T. """ " . - ~T fl /"> /-* TV"' ' '' T ' T* ^ rr- in, „ . 

" i * : > v. v i Vi'.....: J. i , IJOI twth 

CTGCTCTGCA GCTCCACTGA AGCACCCCCT 
GACTCTGCCT CAGCTGGGCC TCCACTGCCC 



/■^IlALCr'il A.CATCCAIA I TCACCwCCA 
CTTTCCTCTG AGCCACAATG TCACACCCAG 
ACCCATCTAT AGATGCCTAA A?CCCG"GCA 
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GTTATCCAGA CACAACTAAA GTTCCATCCC TTCCATGAAG CCTTCCCCAA CCCTCTGGTG 24 0 

GAAGGTCACT TCTTCCTCAT GGGGTTCTGA GCTTTCATTT CTTTTTCTAC TAAGAGTTTT 300 

ACAATTACCT GTTCATACAC TCTACCTGCC CCCATGAGAC CAGGGGCATC TCAGAAACAA 360 

AGATCATTAA AACCAACTAA ATCTATTTCT CATTATAAAA TGAGATATGC TGATTGATTG 4 20 

CAAAATAATA AAATAACAAA GTATGGAAAA GAAAAAAAAA AGCATATAAT CTGGCTGAGA 4 80 

* """""-GAG AC CCriww.ioii^ ji unrun j. ± nioibl IbAM KAtj/Y/Vl AAULr AAAAAACTGC 540 

TTCAGTTTGG CATTATTTAT GTAAGTATAG TATAGGATCC TTAAAATGGT TCAAAGAAAT 600 

GGGAAATCAA GACTTCATTT TGGCAAAGCC ATTGAACAGA AACTGTAGCA TATTTATCAG 6 60 

TAATTTCTTT CAGATTAAAC AACTGACAAC AACCCACTTT TCAACCAGTG ATGTTGGAAA 7 20 

TGTTTTAAAA CAAAATTAGT TCATAAATTT GXGGGTXGAC CAAGAAGGTA ATAAAGTCTC 780 

ACTAAATAAA ATGAGGAAAA TTCAGAAAAA GAAAAAAATA AGAAAATAAA TCACCCATGG 840 

ATCTAAGCAC TATTCATTCT TTAAGGCATG TATTTCCAAG CCTTTTAATT TTTTCATGCC 900 

TAGAGTTGGC ATGGCATATA TATATCTTTA TACAATTCTT CAAATTTTAT AGAATTTGTA 9 60 

TAATGTTTTA TCTTGCTTTT TTTTTAACCA CTGATGTTAT AAGCATATTT ATGCCACTTC 1020 

ATTCACGTTA GAGACTTAAT AATAAAGGAT CTTGTGGATA ATTTATCATT CCCTGATAGA 1080 

GAAAAATTTA GCTTTGCTTA TTTTAGAGTT ATAAATGATG CTGGGTCAGG TATCTTTATG 1140 

TTTGAAGATG GCTCCATATT TGGGTTGTTT CCACAGAACT CTTTCCAGAA ATGCTTTTTC 12 00 

TAGGTTAATG GCTACACATA TTTCTAGGCA CCTGACATAC TGACACCCAC CTCTAAAGTA 12 60 

TTTTTATGAT CCACAACTAG CGTTTAACAC AGCGCCCC -G TCACTCCGAG ACTAATAAAT 1320 

AGACAAATGA CTGAAACGTG ACCTCATGCT TTCTATTCC i' CCAGCTTTCA TTGAGTTCCT 1380 

TTCCTCTGGG AGGACTGGGG GTTGTCTAGC CCTCCACAGC ATCAGCCCAT TGACCCTATC 14 40 

CTTGTGGTTA TAGCAGCTGA GGAAGCAGAA TTACAGCTCT GTGGGAAGGA ATGGGGCTGG 15 00 

AGAGTTCATG CATAGACCAA TTCTTTTTTT TTTTTTTTTT TGAGATGGAG TTTCACTTTT 15 60 

GTTGCCCAGG CTGGAGTGCA ATGGCATGAT CTCAGCTCAC CACAGCCCCC ACCTCCTGGG 16 20 

TTCAAGCGAT TCTCCTGCCC TCAGCCTCCC GAGTAGCTGG GATTACAGGC ATGTGCCACC 1680 

ACGCCTGACT ACTTTTGTAT TTTTAGTAGA GATGGAGTTT CTCTTTCTTG GTCAGGTTGG 17 4 0 

TCTCAAACTC CTGACCTCAG GTGATCTGCA GCCTCGGCCT CCAAAGTGTT GGGATTACAG 1800 

GTGTGAGCGA CCATGCCTGG CTGCATAGAC CAGTTCTTAT GAGAAGGGAT CAACTAAGAA 18 60 
TAGCCTTGGG TTGACACACA CCCCTCTTCA CACTCACAGG AGAAACCCCA TGAAGCTAGA 

ACCAGTCATG AGTTGAGAGC TGAGAGTTAG AGAGTAGCTC AGAGATGCTA TTCTTGGATA 19.0 

"^TATAAAGA GGAGAGATGo ^IGCAGACAT CAGAAGGACG CAGGCAGCAA AGAGTAGTCA 2160 

GTCCCTTCTT GGCTCTGCTG ACACTCGAGC CCACATTCCA TCACCTGCTC CCAATCATGC 2220 

AGGTCTCCAC TGCTGCCCTT GCCGTCCTCC TCTGTACCAT GGCTCTCTG" AAC^AGGT~ r 77- r 
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TCTCTGCACC ACGTGAGTCC ATGTTGTTGT TGTGGGTATC ACCACTCTCT GGCCATGGTT 2 34 0 

AGACCACATC AGTCTTTTTT TGTGGCGTGA GAGGCCCCGA AGAGAAAAGA AGGAAGTTCT 2 4 00 
TAAAGCGCTG CCAAACACCT TGGTCTTTTT CTTCACAACT TTTATTTTTA TCTCTAGAAG 24 60 

GGGTCTTAGC CCTCCTAGTC TCCAGGTATG AGAATCTAGG CAGGGGCAGG GGAGTTACAG 2 520 

TCCCTTGTAC AGATAGAAAA ACAGGGTTCA AAACGAATCA GTTTGCAAGA GGCAGAATCC 2580 

+ r ^„„ r „„ T „ — 77~~; ~- ^x^^^u ^Liuttt iHbTCiuccA 2b4 0 

GGAGCCCTGT CCCTTGGATG TCTTATGAGA GATGTCCAGG GCTTCTCTTG GGCTGGGGTA 2700 

TGACTTCTTG AACCGACAAA ATTCCATGAA GAGAGCTAAG AGAACAGTCC ATTCAGGTAT 27 60 

CTGGATCACA TAGAGAAACA GAGAACCCAC TATGAAGAGT CAAGGGGAAA GAGGAATATA 2820 

G AC A G AAAC A AACACACATT TCTCTGCAAA ACCCCCCAAA TGCCTxGCAG TCACTTGGTC 2B80 

TGAGCAAGCC TGCCCTCCTC AACCACTCAG GGATCAGAAG CTGCCTGGCC TTTTCTTCTG 2 94 0 

AGCTGTGACT TGGGCTTATT CTCTCCTTTC TCCGCAGTTG CTGCTGACAC GCCGACCGCC 3 000 

TGCTGCTTCA GCTACACCTC CCGACAGATT CCACAGAATT TCATAGCTGA CTACTTTGAG 3 060 

ACGAGCAGCC AGTGCTCCAA GCCCAGTGTC ATGTAAGTGC CAGTCTTCCT GCTCACCTCT 312 0 

AGGGAGGTAG GGAGTGTCAG GGTGGGGGCA GAAACAGGCC AGAAGGCCAT CCTGGAAAGG 318 0 

CCCAGCCTTC AGGAGCCTAT CGGGGATACA GGACGCAGGG CACTGAGGTG TGACCTGACT 3 24 0 

TGGGGCTGGA GTGAGGTGGG TGTTACAGAG TCAGGAAGGG CTGCCCCAGG CCAGAGGAAA 3 300 

GGGACAGGAA GAAGGAGGCA GCAGGACACT CTGAGGGCCC CCTTGCCTGG AGTCACTGAG 3 360 

AGAAGCTCTC TAGACGGAGA TAGGCAGGGG GCCCCTGAGA GAGGAGCAGG CCTTGAGCTG 3 4 20 

CCCAGGACAG AGAGCAGGAT GTCAGGGCCA TGGTGGGCCC AGGATTCCCC GGCTGGATTC 3 4 80 

CCCAGTGCTT AACTCTTCCT CCCTTCTCCA CAGCTTCCTA ACCAAGAGAG GCCGGCAGGT 3 54 0 

CTGTGCTGAC CCCAGTGAGG AGTGGGTCCA GAAATACGTC AGTGACCTGG AGCTGAGTGC 3 600 

CTGAGGGGTC CAGAAGCTTC GAGGCCCAGC GACCTCAGTG GGCCCAGTGG GGAGGAGCAG 3660 

GAGCCTGAGC CTTGGGAACA TGCGTGTGAC CTCCACAGCT ACCTCTTCTA TGGACTGGTT 372 0 

ATTGCCAAAC AGCCACACTG TGGGACTCTT CTTAACTTAA ATTTTAATTT ATTTATACTA 378 0 

TTTAGTTTTT ATAATTTATT TTTGATTTCA CAGTGTGTTT GTGATTGTTT GCTCTGAGAG 384 0 

TTCCCCCTGT CCCCTCCACC TTCCCTCACA GTGTGTCTGG TGACAACCGA GTGGCTGTCA 3900 

TCGGCCTGTG TAGGCAGTCA TGGCACCAAA GCCACCAGAC TGACAAATGT GTATCAGATG 3960 

CTTTTGTTCA GGGCTGTGAT CGGCCTGGGG AAATAATAAA GATGTTCTTT TAAACGGTAA 4 020 

ACCAGTATTG AGTTTGGTTT TGTTTTTCTG GCAAATCAAA ATCACTGGTT AAGAGGAATC 4 080 



■.■:.':'( '.vi"aa : at;:,caccct\: iaggagaaiu aactacttga atggccacca ttaagcagag -wet 

TATTCTGTAG GGCATATTCA TGATGAATCA AGCTCTTAAT AGCAATTATT TACATTGTTG 4 32 0 
AGGCTTACTC ctcctactga gtgcttttta tacattgttc ATTTAATCTT A^AAT~ r AA ' 
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TAGTACAGCT TAGGTACTAT TAATACCTCC ACTTGACAGA AAAGTAACCC AGGGCTCAGA 4 44 0 

AAGGTTAGAC AACTTGGCTG AGGTTACACA GCACGTAAAC GGTCAATTGT GTTCCAAAAC 4500 

TGGACTTTTA TTGAACTACA GACTATGCTG TTAACCATTG ACCAAGTTAT TTCCCAAAGT 4560 

ATGACCCGCC TATACTCAAA TCTTACCCCA TTCTTTAACA GATGATACTT TATCCATTGC 4 620 

AACCACTTCC TGTCAGGATT CTGAGTTGAC ATAGAGTGTT TCAGCAGTGA TTATTTAAGC 4 680 

\ tt.\C".t2 ^^x^x^^n^^ ^0^1^ iat ji t t atc aagctcatga 4 740 

GGTGTTCCAT AGCATGTTAA TGACTGAGAG CCACTGTCAA TAGAATTC 4 788 

15 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 
( E ) TYPE: amino aciu 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



25 



40 



55 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



Met Lys Leu Cys Val Thr Val Leu Ser Leu Leu Met Leu Val Ala Ala 

30 1 5 10 15 

Phe Cys Ser Pro Ala Leu Ser Ala Pro Met Gly Ser Asp Pro Pro Thr 
20 25 30 

35 Ala Cys Cys Phe ser Tyr Thr Ala Arg Lys Leu Pro Arg Asn Phe Val 

35 40 45 



Val Asp Tyr Tyr Glu Thr Ser Ser Leu Cys Ser Gin Pro Ala Val Val 
50 55 60 

Phe Gin Thr Lys Arg Ser Lys Gin Val Cys Ala Asp Pro Ser Glu Ser 
65 70 75 80 



Trp Val Gin Glu Tyr Val Tyr Asp Leu Glu Leu Asn 
45 85 90 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
TTCCCCCCCC CCCCCCCCCC CCCCGCCCGA GCACAGGACA CAGCTGGGTT CTGAAGCTTC 60 



"GAATGGGCT CAGACCCTCu GALCGCCTGC XGCTTTXCT: ACACCGCGAG GAAGCTTCCT ^4; 
CGCAACTTTG TGGTAGATTA CTATGAGACC AGCAGCCTCT GCTCCCAGCC AGCTGTGGTA 300 
TTCCAAACCA AAAGAAGCAA GCAAGTCTGT ^CTGAT^^^A ^T-AA^rr^r ^^,7/- A ■■ 
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TACGTGTATG ACCTGGAACT GAACTGAGCT GCTCAGAGAC AGGAAGTCTT CAGGGAAGGT 420 
CACCTGAGCC CGGATGCTTC TCCATGAGAC ACATCTCCTC CATACTCAGG ACTCCTCTCC 4 80 

5 GCAGTTCCTG TCCCTTCTCT TAATTTAATC TTTTTTATGT GCCGTGTTAT TGTATTAGGT 54 0 

GTCATTTCCA TTATTTATAT TAGTTTAGCC AAAGGATAAG TGTCCTATGG GGATGGTCCA 600 
CTGTCACTGT TTCTCTGCTG TTGCAAATAC ATGGATAACA CATTTGATTC TGTGTGTTTT 660 

10 

cuataaiaaa aCTTTAAAAT AAAATGCAGA CAGTTA 696 
(2) INFORMATION FOR SEQ ID NO: 30: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 



25 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 30: 



30 



35 



A 



5 



Met Gin lie He Thr Thr Ala Leu Val Cys Leu Leu Leu Ala Gly Met 
1 5 10 15 

Trp Pro Glu Asp Val Asp Ser Lys Ser Met Gin Val Pro Phe Ser Arg 
20 25 30 

Cys cys Phe Ser Phe Ala Glu Gin Glu He Pro Leu Arg Ala He Leu 
35 40 45 

Cys Tyr Arg Asn Thr Ser Ser He Cys Ser Asn Glu Gly Leu He Phe 
50 55 60 



L ys Leu Lys Arg Gly Lys Glu Ala Cys Ala Leu Asp Thr Val Gly Trp 
40 65 70 75 80 



Val Gin Arg His Arg Lys Met Leu Arg His Cys Pro Ser Lys Arg Lys 

85 90 95 



(2) INFORMATION FOR SEQ ID NO: 31 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
ACCAGGCTCA TCAAAGCTGC TCCAGGAAGG CCCAAGCCAG ACCAGAAGAC ATGCAGATCA 60 

f\0 T^^rf C r ^ r V^C.'J""- f r r '- r ' rf *T r ',r'T^',r'T?>f~ ^^"^rcf.Tr'^r <• 



.AvAu^AA ; r;;TGTTA- AoAAATACCA ; - ATCCATCT' 7CCAATGA \ ■■ . '.TTTA A^^ ^ £ 
TCAAGCTGAA GAGAGGCAAA GAGGCCTGCG CCTTGGACAC AGTTGGATGG GTTCAGAGGC 3 00 

ACAGAAAAAT GCTGAGGCAC TGCCCGTCAA AAAGAAAATG AGCAGATTTC TTTCCATTGT 360 
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GGGCTCTGGA AACCACATGG CTTCACCTGT CCCCGAAACT ACCAGCCCTA CACCATTCCT 420 
TCTGCCCTGC TTTTGCTAGG TCACAGAGGA TCTGCTTGGT CTTGATAAGC TATGTTGTTG 4 80 

5 CACTTTAAAC ATTTAAATTA TACAATCATC AACCCCCAAC 52 0 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 99 amino acids 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 



20 



35 



40 



55 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 32. 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu He Ala Ala Thr 
15 10 15 



Phe He Pro Gin Gly Lys Ala Gin Pro Asp Ala He Asn Ala Pro Val 

25 20 25 30 

Thr Cys Cys Tyr Asn Phe Thr Asn Arg Lys He Ser Val Gin Arg Leu 

35 40 45 

30 Ala Ser Tyr Arg Arg He Thr Ser Ser Lys Cys Pro Lys Glu Ala Val 

50 55 60 



He Phe Lys Thr He Val Ala Lys Glu He Cys Ala Asp Pro Lys Gin 
65 70 75 80 

Lys Trp Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr 

85 90 95 

Pro Lys Thr 



(2) INFORMATION FOR SEQ ID NO: 33 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 7 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTAACCCAGA AACATCCAAT TCTCAAACTG AAGCTCGCAC TCTCGCCTCC AGCATGAAAG 60 
TCTCTGCCGC CCTTCTGTGC CTGCTGCTCA TAGCAGCCAC CTTCATTCCC CAAGGGCTCG 120 
CTCAGCCAGA TGCAATCAAT GCCCCAGTCA CCTGCTGTTA TAACTTCACC AATAGGAAGA 180 
60 TCTCAGTGCA GAGGCTCGCG AGCTATAGAA GAATCACCAG CAGCAAGTCT ^rrA^.R^.r- 

:.;v, . ;,:cgaAoA^; igaacactca 

■JTCCACAACC CAAGAATCTG CAGCTAACTT ATTTTCCCCT AGCTTTCCCC AGACACCCTG 420 
TTTTATTTTA TTATAATGAA TTTTGTTTGT TGATGTGAAA CATTATGCCT TAAGTAATGT 4 80 
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TAATTCTTAT TTAAGTTATT GATGTTTTAA GTTTATCTTT CATGGTACTA GTGTTTTTTA 54 0 

GATACAGAGA CTTGGGGAAA TTGCTTTTCC TCTTGAACCA CAGTTCTACC CCTGGGATGT 6 00 

TTTGAGGGTC TTTGCAAGAA TCATTAATAC AAAGAATTTT TTTTAACATT CCAATGCATT 660 

GCTAAAATAT TATTGTGGAA ATGAATATTT TGTAACTATT ACACCAAATA AATATATTTT 720 

TGTAC 725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Ala Ser Ala Ala Leu Leu Cys Leu Leu Leu Thr Ala Ala Ala 
15 10 15 

Phe Ser Pro Gin Gly Leu Ala Gin Pro Val Gly lie Asn Thr Ser Thr 

20 25 30 

Thr Cys Cys Tyr Arg Phe lie Asn Lys Lys lie Pro Lys Gin Arg Leu 
35 40 45 

Glu Ser Tyr Arg Arg Thr Thr Ser Ser His Cys Pro Arg Glu Ala Val 
50 55 60 

lie Phe Lys Thr Lys Leu Asp Lys Glu lie Cys Ala Asp Pro Thr Gin 
65 70 75 80 

Lys Trp Val Gin Asp Phe Met Lys His Leu Asp Lys Lys Thr Gin Thr 

85 90 95 

Pro Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
AGCAGAGGGG CTGAGACCAA ACCAGAAACC TCCAATTCTC ATGTGGAAGC CCATGCCCTC 60 
ACCCTCCAAC ATGAAAGCCT CTGCAGCACT TCTGTGTCTG CTGCTPftrAr: rA.rr^n^T- 



CCACTGTCCC CGGGAAGCTG TAATCTTCAA GACCAAACTG GACAAGGAGA TCTGTGCTGA 
CCCCACACAG AAGTGGGTCC AGGACTTTAT GAAGCACCTG GACAAGAAAA CCCAAACTCC 



300 
360 



10 
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AAAGCTTTGA ACATTCATGA CTGAACTAAA AACAAGCCAT GACTTGAGAA ACAAATAATT 420 

TGTATACCCT GTCCTTTCTC AGAGTGGTTC TGAGATTATT TTAATCTAAT TCTAJ JGAAT 4 80 

ATGAGCTTTA TGTAATAATG TGAATCATGG TTTTTCTTAG TAGATTTTAA AAGTTATTAA 54 0 

TATTTTAATT TAATCTTCCA TGGATTTTGG TGGGTTTTGA ACATAAAGCC TTGGATGTAT 600 

ATGTCATCTC AGTGCTGTAA AAACTGTGGG ATGCTCCTCC CTTCTCTACC TCATGGGGGT 660 



30 



45 



50 



rw j.o ±e\j.n*\\j iu^nuunAO nniuAUiUL-ft rtftbttlli(jti IihAi I\3lXA AO/ViTVIAj/ViAj /^U 

TCCCTATGGA AGCATATTGT TAT TAT AT AA TTACATATTT GCATATGTAT GACTCCCAAA 780 

15 TTTTCACATA AAATAGATTT TTGTAAAAAA 810 
(2) INFORMATION FOR SEQ ID NO: 36: 



20 (A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Val Ser Ala Ala Arg Leu Ala Val He Leu He Ala Thr Ala 
15 10 15 



Leu Cys Ala Pro Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thr Thr Pro 

35 20 25 30 

Cys Cys Phe Ala Tyr He Ala Arg Pro Leu Pro Arg Ala His He Lys 
35 40 45 

40 Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala Val Val Phe 

50 55 60 



Val Thr Arg Lys Asn Arg Gin Val Cys Ala Asn Pro Glu Lys Lys Trp 
65 70 75 80 

Val Arg Glu Tyr He Asn Ser Leu Glu Met Ser 

85 90 

(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
55 ( D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



.. ..^bG.GCi TGCCTACATI GCCCGCCCAl. TGCCCCGTGC CCACATCAAG GAG TAT T TCI ..b.' 
ACACCAGTGG CAAGTGCTCC AACCCAGCAG TCGTCTTTGT CACCCGAAAG AACCGCCAAG 24 0 

TGTGTGCCAA CCCAGAGAAG AAATOGGTTC GGGAGTACAT CAACTTTTTG G AGATGAGCT 
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AGGATGGAGA GTCCTTGAAC CTGAACTTAC ACAAATTTGC CTGTTTCTGC TTGCTCTTGT 3 60 

CCTAGCTTGG GAGGCTTCCC CTCACTATCC TACCCCACCC GCTCCTTGAA GGGCCCAGAT 4 20 

TCTGACCACG ACGAGCAGCA GTTACAAAAA CCTTCCCCAG GCTGGACGTG GTGGCTCAGC 4 80 

CTTGTAATCC CAGCACTTTG GGAGGCCAAG GTGGGTGGAT CACTTGAGGT CAGGAGTTCG 54 0 

AGACAGCCTG GCCAACATGA TGAAACCCCA TGTGTACTAA AAATACAAAA AATTAGCCGG 60 0 

r , r , r ,r rr , /^"T , R rr> n^/inr'n/<nirtw •»'■•■«-'- — — — — . _ ^ 

r - ..w^-. 1. -l ^uwuauu \«, j, urioov^H^jVj /i\j/\ArLjUCG'I 66 0 

GAACCCGGGA GCGGAGCTTG CAGTGAGCCG AGATCGCGCC ACTGCACTCC AGCCTGGGCG 72 0 

ACAGAGCGAG ACTCCGTCTC AAAAAAAAAA AAAAAAAAAA AAAAAATACA AAAATTAGCC 780 

GCGTGGTGGC CCACGCCTGT AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT 84 0 

GAACCCAGGA GGTGGAGGCT GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT 900 

GACAAAGTGA GACTCCGTCA CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA 960 

AGAGCTTCTG AGGCGCTGCT TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG 1020 

CCTTGGCTTT GCAAGGGCTC TGTGACAAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGG 1080 

AAGGGAGGAA CACTGCACTC TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG 114 0 

GCAAACATGA AAAATCGGGG 1160 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Arg lie Ser Ala Thr Leu Leu Cys Leu Leu Leu lie Ala Ala Ala 
1 5 10 15 

Phe Ser He Gin Val Trp Ala Gin Pro Asp Gly Pro Asn Ala Ser Thr 
20 25 30 

Cys Cys Tyr Val Lys Lys Gin Lys He Pro Lvs Arg Asn Leu Lys Ser 
35 40 45 

Tyr Arg Arg He Thr Ser Ser Arg Cys Pro Trp Glu Ala Val He Phe 

50 55 60 

Lys Thr Lys Lys Gly Met Glu Val Cys Arg Glu Ala His Gin Lys Trp 
65 70 75 80 

Val Glu Glu Ala He Aln Tvr T.eu Apt Mp*- fvc ~w p r . ~i >■ 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) APOLOGY : linear 



vax; .juyux^Cci uL^LKif iiuw: Siny ID NO : J y : 

ACTGAAGCCA GCTCTCTCAC TCTCTTTCTC CACCATGAGG ATCTCTGCCA CGCTTCTGTG 60 

5 CCTGCTGCTC ATAGCCGCTG CTTTCAGCAT CCAAGTGTGG GCCCAACCAG ATGGGCCCAA 12 0 

TGCATCCACA TGCTGCTATG TCAAGAAAC: AAGATCCCC AAGAGGAATC TCAAGAGCTA 180 

CAGAAGGATC ACCAGTAGTC GUTGTCCCTG GGAAGCTGTT ATCTTCAAGA CAAAGAAGGG 24 0 

CATGGAAGTC TCTCGTGAAG CCCATCAGAA GTGGGTCGAG GAGGCTATAG CATACTTAGA 300 

CATGAAAACC CCAACTCCAA AGCCTTGAAG AAATGTGCCT GAACAGAAAC CAACCTAGGA 360 

GCCAAGAAGC AAAAATTCCT CACCGCTGTT CTTTCTGAGA ACTGTTGATG AAATGTGTTG 420 

ATCACGGTCC TAAGGGATAG GAGCTGTCTG TAGGAATGTG AAACAGTCAC GCCTAAGGAA 480 

TGGTCTTTAA GTTATTAATA TTTTTATTTA ATTAGCCATG TACTTTGGTG TGATTTGAAT 540 

GTAAAGCTCT GGAGACCTCA TGTCACTTTA ACATTGTGTT AGCTGCAGAA TTC 593 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION; SEQ ID ■NO: 40: 

Asp Ser Val Ser lie Phe He Thr Cys Cys Phe Asn Val lie Asn Arq 
1 5 10 -- 



15 



Lys He Pro lie Gin Arg Leu Glu Ser Tyr Thr Arg He Thr Asn lie 

20 25 30 

Gin Cys Pro Lys Glu Ala Val lie Phe Lys Thr Gly Lys Glu Val Cys 
35 40 45 

Ala Asp Pro Lys Glu Arg Trp Val Arg Asp Ser Met Lys His Lys Asp 



60 



Gin He Phe Gin Asn Leu Lys Pro 



.^Jl'LKCt CHARACTERISTICS; 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

CD) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Cys 
15 10 15 

10 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

J 5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Cys Asn Gin Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys He 
23 1 5 io 15 

Cys Leu Asp Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys 
20 25 30 

30 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TGCAACCAAG TCGAAGTGAT AGCCACACTG AAGGATGGGA GGAAAATCTG CCTGGACCCA 
GATGCTCCCA GAATCAAGAA AATTGTACAG AAAAAA 



60 
96 
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CLAIMS 

Wc claim: 

1 . A method of screening for AHA compounds comprising the steps of: 

a) contacting a compound with radiolabeled heparin/heparan sulfate and 
5 heparan ase; 

b) maintaining the mmnnnnHc in pnntQ^* v**i* * Un !.wj 

sulfate and heparanase for a time and under such conditions sufficient to allow 
inhibition of heparanase activity; 

c) detecting inhibition of heparanase activity (a compound that gives 50% 
10 inhibition at a concentration of 1 uM or less); and 

d) selecting compounds that inhibit heparanase activity. 



2. A method according to claim 1 wherein oie heparanase is recombinant. 

15 3. A heparanase having an isoelectric point of less than 5.5 and possessing activity greater 
than 20 units heparanase activity per pg protein. 

4. A heparanase according to Claim 3, having an isoelectric point of about 4.8 - 5.1. 



20 5. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence of 
reducing conditions and activated with transglutaminase, having an amino acid sequence selected 
from the group consisting of SEQ. ID NO: l t SEQ. ID NO: 3, SEQ. ID NO: 5 or SEQ. ID NO: 
7. 



25 6. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence of 
reducing conditions, having zr amino acid sequence of SEQ. ID NO: I. 



7. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence of 
reducing conditions and activated with transglutaminase, having an amino acid sequence of 
30 SEQ. ID NO: 3. 

■ L 1 \ A A J , i ! \ ; I 1 ■ 



9. A heparanase, as in claim 3, prepared by recombinant means, activated ith 
transglutaminase and having an amino acid sequence selected from the croup con^tinc n f 
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ID NO: 1, SEQ. ID NO: 3, SEQ. ID NO: 5 or SEQ. ID NO: 7. 

10. A heparanase, as in claim 3, prepared by recombinant means, activated with 
transglutaminase and having an amino acid sequence of SEQ. ID NO: 1. 



transglutaminase and having an amino acid sequence of SEQ. ID NO: 3. 

12. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence 
10 of reducing conditions and activated with transglutaminase, and havinc an amino acid s<iniie.iire 

selected from the group consisting of SEQ. ID. NO: 12, SEQ. ID. NO: 14; SEQ. ID. NO: 16, 
SEQ. ID. NO: 18, SEQ. ID. NO: 20, SEQ. ID. NO: 22 and SEQ. ID. NO: 24. 

13. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence 
15 of reducing conditions and activated with transglutaminase, and having an amino acid sequence 

selected from the group consisting of SEQ. ID. NO: 26, SEQ. ID, NO: 28, SEQ, ID. NO: 30, 
SEQ. ID. NO: 32; SEQ. ID. NO: 34, SEQ. ID. NO: 36, SEQ. ID. NO: 38 and SEQJD. NO: 40. 

14. A method according to claim 1 wherein the heparanase is purified to apparent 
20 homogeneity, prepared in the presence of reducing conditions and activated with 

transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID. NO: 12, SEQ. ID. NO; 14; SEQ. ID. NO: 16, SEQ. ID. NO: 18, SEQ. ID. NO: 20, 
SEQ. ID. NO: 22 and SEQ. ID. NO: 24. 

25 15. A method according to claim 1 wherein the heparanase is purified to apparent 
homogeneity, prepared in the presence of reducing conditions and activated with 
transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID. NO: 26, SEQ. ID. NO: 28, SEQ. ID. NO: 30, SEQ. ID. NO: 32; SEQ. ID. NO: 34, 
SEQ. ID. NO: 36, SEQ. ID. NO: 38 and SEQ. ID. NO: 40. 



16. A method according to claim 1 wherein the heparanase is purified to apparent 



>uQ. a) Mj i. Sh^. iu Nu. „\ SEQ. ID NO. > or i>bQ. ID NU. / 



17. A peptide having an amino acid sequence of SEQ. ID NO- 42 
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